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For addressing optimisation tasks on finite dimensional quantum systems, we give a comprehensive 
account on the foundations of gradient flows on Riemannian manifolds including new applications 
to quantum control: we extend former results on unitary groups to closed subgroups with tensor- 
product structure, where the finest product partitioning consists of purely local unitary operations. 
Moreover, the framework is kept sufficiently general for setting up gradient flows on (sub-)manifolds, 
Lie (sub-)groups, and (reductive) homogeneous spaces. Relevant convergence conditions are dis- 
cussed, in particular for gradient flows on compact and analytic manifolds. This part of the paper 
is meant to serve as foundation for some recent and new achievements, and as setting for further 
research. 

Exploiting the differential geometry of quantum dynamics under different scenarios helps to pro- 
vide highly useful algorithms: (a) On an abstract level, gradient flows may establish the exact upper 
bounds of pertinent quality functions, i.e. upper bounds reachable within the underlying manifold 
of the state space dynamics; (b) in a second stage referring to a concrete experimental setting, 
gradient flows on the space of piecewise constant control amplitudes in R m may be set up to yield 
(approximations to) optimal control for quantum devices under realistic conditions. 

fllustrative examples and new applications are given, such as figures of merit on the subgroup 
of local unitary action on n qubits relating to distance measures of pure-state entanglement. We 
establish the correspondence to best rank-1 approximations of higher order tensors and show 
applications from quantum information, where our gradient flows on the subgroup of local unitary 
operations provide a numerically stable alternative to tensor-SVD techniques. 
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I. INTRODUCTION 

Controlling quantum systems offers a great potential 
for performing computational tasks or for simulating the 
behaviour of other quantum systems (which are diffi- 
cult to handle experimentally) or classical systems [H, Q , 
when the complexity of a problem reduces upon going 
from a classical to a quantum setting Q. Important ex- 
amples are known in quantum computation, quantum 
search and quantum simulation. Most prominently, there 
is the exponential speed-up by Shor's quantum algo- 
rithm of prime factorisation [J, Q, which on a general 
level relates to the class of quantum algorithms @, @| 
solving hidden subgroup problems in an efficient way 
Q. In Grover's quantum search algorithm [{J [l(| one 



still finds a quadratic acceleration compared to classical 
approaches [111 ]. Recently, the simulation of quantum 
phase-transitions [l2j has shifted into focus [H, E3| ■ 

Among the generic tools needed for advances in quan- 
tum technology, for a survey see, e.g., [TB|, quantum con- 
trol plays a major role. Its key concern is not only to find 
(optimal) control strategies for quantum dynamical sys- 
tems such that a certain performance index is maximised 
(typical examples being quantum gate fidelities, efficien- 
cies of state transfer or coherence transfer, as well as dis- 
tances related to Euclidean entanglement measures) but, 
moreover, to develop constructive ways for implementing 
controls under realistic experimental settings. Usually, 
such figures of merit depend on terminal conditions and 
running cost, like time or energy, cf. Section [TTJ In quan- 
tum control, however, important classes of performance 
indices are completely determined by some quality func- 
tion and its value at the system's final state. 



Since realistic quantum systems are mostly beyond an- 
alytical tractability, numerical methods are often indis- 
pensable. A good strategy is to proceed in two steps: 
(a) firstly, by exploring the possible gains on an abstract 
and computationally cheap level, i.e. by maximising the 
quality function cither over the entire state space or over 
the set of possible states — the so-called reachable set; (b) 
secondly, by going into optimising the experimental con- 
trols ('pulse shapes') in a concrete setting. However, (b) 
is often computationally expensive and highly sensitive, 
as it actually consists of solving an infinite dimensional 
constrained variational problem. 

By merely depending on the geometry of the underly- 
ing state-space manifold the first instance (a) allows for 
analysing in advance and on an abstract level the limits 
of what one can achieve in step (b). We therefore re- 
fer to (a) as the abstract optimisation task. The second 
instance, in contrast, hinges on introducing the specific 
time scales of an experimental setting for finding steer- 
ings of the quantum dynamical system such that the op- 
tima determined in (a) are actually assumed. This is why 
we term (b) the dynamic optimal control task. Certainly, 
one can approach the entire problem only in terms of (b) 
and sometimes one is even forced to do so, e.g. if nothing 
is known about the geometric structure of the reachable 
sct. Yet, the above two-fold strategy may serve to yield a 
benchmark in (a) forjudging the reliability of the numeri- 
cal results of (b). In both regards, gradient-flow methods 
will prove to be particularly useful. 

In a pioneering paper Brockett introduced the 

idea of exploiting gradient flows on the orthogonal group 
for diagonalising real symmetric matrices and for sorting 
lists. In a series of subsequent papers he extended the 
concept to intrinsic gradient methods for (constrained) 
optimisation [iji [l8| . Soon these techniques were gener- 
alised to Riemannian manifolds, their mathematical and 
numerical details were worked out [l9|, [2(| and thus 
they turned out to be applicable to a broad array of op- 
timisation tasks including eigenvalue and singular-value 
problems, principal component analysis, matrix least- 
squares matching problems, balanced matrix factorisa- 
tions, and combinatorial optimisation — for an overview 
see, e.g., [2l|,[22j|. Implementing gradient flows for opti- 
misation on smooth manifolds, such as unitary orbits, 
inherently ensures the discretised flow remains within 
the manifold by virtue of Riemannian exponential map. 
Alternatively, formulating the optimisation problem on 
some embedding Euclidean space comes at the expense 
of additional constraints (e.g. enforcing unitarity) to 
be taken care of by Lagrange-type or penalty-type tech- 
niques. In this sense, gradient flows on manifolds are in- 
trinsic optimisation methods p3j . whereas extrinsic op- 
timisations on the embedding spaces require non-linear 
projective techniques in order to stay on the (constraint) 
manifold. 

Using the differential geometry of matrix manifolds has 
thus become a field of active research. For new develop- 
ments (however without exploiting the Lie structure to 
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the full extent) see, e.g., [24J. — Even beyond manifolds, 
gradient flows have recently been described for metric 
spaces with applications of probability theory [2f| . 

Meanwhile, gradient flows and their discrete numeri- 
cal integration schemes have also proven powerful tools 
of optimisation in quantum dynamics. This holds in both 
of the two types of tasks: (a) for exploring the maxima 
of pertinent quality functions on the reachable set of a 
quantum system, e.g. on the unitary group and its orbits 
[26l . [27l . l28l | and (b) for arriving at concrete experimen- 
tal steerings (i.e. 'pulse shapes') actually achieving the 
quality limits established in (a) under given experimen- 
tal conditions for closed [1^, [3y, [3l| or open [32, [H, [HJ 
systems. Recently we thus gave upper limits for time 
complexities of implementing unitary modules [30| . or 
for exploring the potential of quantum compilation with 
complex instruction sets [3^ . 

Moreover, in view of using gradient techniques for uni- 
fying variational approaches to ground-state calculations 
|36j | . it will be useful to exploit a common framework 
of gradient flows on Ricmannian manifolds as well as 
projective techniques on their tangent spaces. To this 
end, we also show how gradient flows can readily be re- 
stricted, e.g., from Lie groups G to any closed subgroup 
H, in particular any closed subgroup of SU(N). Since 
quantum dynamics often takes place in a subspace of the 
entire Hilbert space so that long-range entangling corre- 
lations can be neglected on the basis of area laws (see, 
e.g. [13, [H, H^]), truncating the Hilbert space to the per- 
tinent subspaces is tantamount to representing dynamics 
of large systems. For instance, unitary networks use con- 
secutive partitionings into different subgroups that can 
be applied to efficiently compute ground states of large- 
scale quantum systems with a cost increasing polyno- 
mially with system size while retaining sufficiently good 
approximations. Current approaches of truncating the 
full-scale Hilbert space into the according subspaces in- 
clude matrix-product states (MPS) [I(J, fill of density- 
matrix renormalisation groups (DMRG) |42l . |43| , quan- 
tum cellular automata with Margolus partitionings [44| . 
projected entangled pair states (PEPS) [45[ weighted 
graph states (WGS) [46|, multi-scale entanglement renor- 
malisation approaches (MERA) [47[, string-bond states 
(SB S) | 48j as well as combinations of different techniques 
[36l . |49(. - It is noteworthy, however, that in many- 
particle physics gradient flows for diagonalising Hamil- 
tonians were re-introduced independently of Brockett's 
work [l6} by Wegner [5(j and were further elaborated 
on again independently of the monography by Helmke 
and Moore [21( or the one by Bloch [23] in the tract by 



Kehrein [5l|. Suffice this to illustrate the need for mak- 



ing the mathematical methods available to the physics 
community in a comprehensive way. 

Another field of applications of restricting flows to 
closed subgroups of SU(N) is entanglement of multi- 
partite quantum systems (53 . [53j] : we present a connec- 
tion from gradient flows on the subgroup of local unitary 
operations to best rank-1 approximations of higher or- 



der tensors as well as a relation to tensor-SVDs. They are 
of importance, e .g., in view of optimisation of entangle- 
ment witnesses [54| . Gradient flows on partitionings of 
the full unitary group are anticipated to be of use also 
for classifying multi-partite systems according to their 
mutual separability, an example being three-tangles of 
GHZ-type and W-type states H, [H, [57} . 

Moreover, with the framework of treating gradient 
flows on Ricmannian manifolds being very general, we 
will also show how they can be carried over to homoge- 
neous spaces that do no longer form Lie groups them- 
selves. Standard examples are coset spaces G/H, i.e. the 
quotient of a Lie group G by a closed (yet not necessarily 
normal) subgroup H. In particular, naturally reductive 
homogeneous space are in the focus of interest. The well- 
known double-bracket flows will be demonstrated to form 
a special case precisely of this kind. 

Though gradient flows on the set of control amplitudes 
can be seen as another instance of flows on Riemannian 
manifolds, our paper does not primarily focus on optimal 
control. The goal is rather to give a comprehensive ac- 
count of the foundations of gradient flows — and thus the 
justification for some recent developments — as well as to 
present new flows for intrinsic or extrinsic constraints and 
new schemes of flows on reductive homogeneous spaces. 
Terms are kept general enough to trigger future devel- 
opments, since we elucidate the necessary requirements 
for implementing gradient-based optimisation methods in 
different geometric settings: Riemannian manifolds and 
submanifolds, Lie groups and homogeneous spaces. 

A separate paper on open quantum systems (58j sets 
up a formal approach within the framework of Lie semi- 
groups accounting for Markovian quantum evolutions (or 
Markovian channels). There we also show the current 
limits of abstract optimisation over reachable sets specif- 
ically arising in open systems. The differential geometry 
of the set of all completely positive, trace-preserving in- 
vertible maps is analysed in the framework of Lie semi- 
groups. In particular, the set of all Kossakowski-Lindblad 
generators is retrieved as its tangent cone (Lie wedge). 
Moreover, it shows how the Lie-semigroup structure cor- 
responds to the Markov properties recently studied in 
terms of divisibility [59] . It illustrates why abstract opti- 
misation tasks for open systems are much more intricate 
than in the case of closed system, while dynamic opti- 
mal control tasks for open systems can be handled com- 
pletely analogously. It specifies algebraic conditions for 
time-optimal controls to be the method of choice in open 
systems. Finally it draws perspectives to new algorith- 
mic approaches on semigroup orbits combining (abstract) 
knowledge of the respective Lie wedges with elements of 
numerical optimal control. 



Outline 

To begin with, we consider flows on (Ricmannian) 
manifolds and recall some basic terminology on dynami- 
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cal systems and Riemannian geometry. Then the aim is 
to provide the differential geometric tools for setting up 
gradient flows in different scenarios ranging from optimi- 
sation over the entire unitary group to subgroups (e.g. of 
local actions) or homogeneous spaces. Finally we give a 
number of applications including worked examples. 

More precisely, the paper is organised as follows: Sec- 
tion [IT] draws a general sketch of dynamical systems 
and flows on manifolds including issues of reachability 
and controllability. It provides the manifold setting for 
gradient-flow-based algorithms like steepest ascent, con- 
jugate gradients, Jacobi-typc, and Newton methods. 

A detailed analysis is then given in Section [1111 where 
(1) we resume the general preconditions for gradient flows 
on smooth manifolds. In particular, we recall the role of a 
Riemannian metric that allows for identifying the cotan- 
gent bundle T*M with the tangent bundle TM. Large 
parts of the foundations can be found in [U [24], [(3(| , but 
here we additionally provide a comprehensive overview of 
the interplay between Riemannian geometry, Lie groups, 
and (reductive) homogeneous spaces. (2) We give ex- 
amples of gradient flows on compact Lie groups as well 
as their closed subgroups. (3) In view of further de- 
velopments, we address gradient flows on several types 
of reductive homogeneous spaces: Cartan-like, naturally 
reductive ones and merely reductive ones. In particu- 
lar, double-bracket flows turn out as gradient flows on 
naturally reductive homogeneous spaces. (4) Examples 
interdispersed in the main text illustrate the relevance in 
a plethora of different settings. 

Section HVl is dedicated to specific applications in quan- 
tum control and quantum information. (1) We show 
how gradient flows on the subgroup of local unitaries 
SU\ oc (2 n ) in n qubits do not only provide a valuable 
tool in witness optimisation, but relate to generalised 
singular- value decompositions (svd), namely the tensor- 
SVD. Here, our gradient flows yield an alternative to com- 
mon algorithms for best rank-1 approximations of higher- 
order tensors, e.g. higher-order power methods (hopm) 
or higher-order orthogonal iteration (hooi). (2) Flows 
on SUi oc (2 n ) also serve as a convenient tool to decide 
whether Hamiltonian interactions can be time-reversed 
solely by local unitary manipulations thus complement- 
ing the algebraic assessment given in [28[. (3) Optimisa- 
tion tasks with (i) intrinsic and (ii) extrinsic constraints 
are addressed by tailored gradient flows on the respective 
subgroups (i) or with Lagrange- type penalties (ii). By 
including practical applications and worked examples we 
illustrate the ample range of problems to which gradient 
flows on manifolds provide valuable solutions. — To this 
end, we start out by an extended overview on techniques 
on (Riemannian) manifolds with particular emphasis on 
gradient techniques. 



II. OVERVIEW 

Flows and Dynamical Systems 

In this paper, we treat various optimisation tasks for 
quantum dynamical systems in a common framework, 
namely by gradient flows on smooth manifolds. Let M 
denote a smooth manifold, e.g. the unitary orbit of all 
quantum states relating to an initial state Xq. By a 
continuous-time dynamical system or a flow one defines 
a smooth map 

$:lxM->M (1) 

such that for all states X £ M and times t, t G R one 
has 

Ho,x) = x 

$(r,$(t,X)) = $(i + t,X) . { ' 

Since these equations hold for any X £ M, one gets the 
operator identity 

$ r o $ t = $ t+r (3) 

for all t, t £ R, thus showing the flow acts as a one- 
parameter group, and for positive times t, t > as a 
one-parameter semigroup of diffeomorphisms on M. 

Gradient Flows for Optimisation 

Now, the general idea for optimising a scalar quality 
function on a smooth manifold M (which might cither 
arise naturally or from including smooth equality con- 
straints, vide infra) by dynamical systems is as follows: 
Let / : M — > ffi. be a smooth quality function on M. The 
differential of / : M -> K is a mapping Df : M — > T*M 
of the manifold to its cotangent bundle T*M, while the 
gradient vector field is a mapping grad / : M — > TM to 
its tangent bundle TM. So the gradient of / at X £ M, 
denoted grad/(X), is the vector in TxM uniquely de- 
termined by 

D/(X) •£= (grad/pO|£>x for all £ G T X M. (4) 

Here, the scalar product {-\-)x plays a central role: it al- 
lows for identifying T X M with T X M. The pair (M, (■]■)) 
is called a Riemannian manifold with Riemannian met- 
ric (-|-). In view of gradient flows, the convenience of 
Riemannian manifolds lies in the fact that by duality in 
particular the differential D / (X) of / at X can be iden- 
tified with a tangent vector of TxM. 

Then, the flow $ : M x M — ► M determined by the 
ordinary differential equation 

X = grad/(X) (5) 

is termed gradient flow. Formally, it is obtained by inte- 
grating Eqn. ([5]), i.e. 

$(t,X)=*(jk,$(0,X))=X(t), (6) 
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Figure 1: (Colour online) The Riemannian exponential exp x 
is a smooth map taking the tangent vector £ TxM at 
X e M to exp x (££) € M. By varying t € R, it yields the 
unique geodesic with initial value X £ M and 'initial velocity' 
t e T X M. 



where X(t) denotes the unique solution of Eqn. |(SJ) with 
initial value X(0) = X. Observe this ensures that / does 
increase along trajectories $ of the system by virtue of 
following the gradient direction of /. 



Discretised Gradient Flows 

Gradient flows may be envisaged as natural continuous 
versions of the steepest ascent method for optimising a 
real-valued function / : R m — > R by moving along its 
gradient grad/ £ R m , i.e. 

Steepest Ascent Method 

X k+1 = X k + a k gmdf(X k ), (7) 

where a k > is an appropriate step size. 

Here, the right hand side of Eqn. ||7J) docs make sense, as 
the manifold M = R m coincides with its tangent space 
TxM = R m containing grad/(X). Clearly, a generali- 
sation is required as soon as M and TxM are no longer 
identifiable. This gap is filled by the Riemannian expo- 
nential map defined by 

exp x : T X M — ► M , £ h-> ex Px (£) (8) 

so that 1 1— > exp x (t£) describes the unique geodesic with 
initial value X £ M and 'initial velocity' £ <G TxM as 
illustrated in Fig. [1] 

If the manifold M carries the structure of a matrix 
Lie group G, we may identify the tangent space element 
£ G TxG with {IX, where Vt is itself an element of the Lie 
algebra q, i.e. the tangent space at the unity element q = 
T\G. Moreover, if the Lie-group structure matches with 
the Riemannian framework in the sense that the metric 
is bi-invariant (as will be explained in more detail later) , 
then the Riemannian exponential of £ = flX can readily 
be calculated explicitly. This is done in three steps by 

(i) right translation with the inverse group element X , 

(ii) taking the conventional exponential map of the Lie 
algebra element Q, (iii) right translation with the group 



element X as summarised in the following diagram 
£ = nX G TxG CXPx ) e n X G G 

lax (9) 

n G ^ ► e n G G . 

Next, the gradient system (JSJ) will be integrated (to 
sufficient approximation) by a discrete scheme that can 
be seen as an intrinsic Euler step method. This can be 
performed by way of the Riemannian exponential map, 
which is to say straight line segments used in the standard 
method are replaced by geodesies on M. This leads to 
the following integration scheme which is well-defined on 
any Riemannian manifold M. 

1. Riemannian Gradient Method 

X k+1 : = cxp Xfc (a k grad/(X fc )) (10) 

where a k > denotes a step size appropriately se- 
lected to guarantee convergence, cf. Section Hill 

For matrix Lie groups G with bi-invariant metric, 
Eqn. I|10p simplifies to 

1'. Gradient Method on a Lie Group 

X k+1 : = cxp(a fc grad/(X fe ) X^)X k , (11) 

where exp :g-*G denotes the conventional expo- 
nential map. 

In either case, the iterative procedure can be pictured as 
follows: at each point X k G M one evaluates grad/(X/j) 
in the tangent space Tx k M. Then one moves via the 
Riemannian exponential map in direction grad/(X^) to 
the next point X k+ \ on the manifold so that the qual- 
ity function / improves, f(X k+ i) > f(X k ), as shown in 

Fig.m 

The steepest ascent approach just outlined is most 
basic for addressing abstract optimisation tasks intrin- 
sically. Other intrinsic iterative schemes exploiting the 
underlying Riemannian geometry like conjugate gradi- 
ents, Jacobi-type methods or Newton's method can be 
obtained similarly. For an introduction to these more 
advanced topics beyond the subsequent sketch see, e.g., 
Refs. [II II, [&3 ■ 

2. Conjugate Gradient Method 

X l + l := argmax/(cxp x; (tfl l k )) 

*>° (12) 

y0 vn 

!grad/(X£) for; = 

g rad/(xi) + 4n X)bl _ liX ,(n$ ! _ 1 ) 
for I = 1, . . . ,n — 1, 

where a l k is a real parameter and IIx,y(^) denotes 
the parallel transport of f2 along the geodesic from 
X to Y. 
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f(X) 




XeM 



T/ 



M 




Figure 2: (Colour online) Abstract optimisation task: the 
quality function / :M-»R,Xh f(X) (top trace) is driven 
into a (local) maximum by following the gradient flow X — 
grad/(X) on the manifold M (lower trace). 

3. Jacobi-Type Method 

X[ +1 := argmax/(exp x , (*n,(^i))) 

tGR V k ' (13) 

yQ vs 

where Oo, Oi, . . . , ft s -i are vector fields such that 
Q {X), fii(X), . . . , O s _i(X) span T X M for all X G 
M. The integer s is called sweep length. 

4. Newton's Method 

X k+1 := exp Xfc ( - (KessfiXk))' 1 grad/(X fc )) , (14) 
where Hcss/(AT) denotes the Hessian of / at X. 

Gradient-Based Methods for Optimal Control 

Up to now we have addressed optimisation tasks over 
state spaces forming abstract manifolds M — hence the 
term abstract optimisation task (aot). In this subsec- 
tion, we briefly describe how gradient methods like the 
one in Ref. [29j arise in the context of optimal control 
tasks (oct). These algorithms are of practical relevance, 
as in many applications, the entire state space M of a 
physical system (E) evolving under some internal dynam- 
ics and external controls is not accessible. 

Now the general optimal control task amounts to find- 
ing the time course of controls to achieve a maximum of 



a given quality functional /, which may depend on the 
time evolution of the controls and the state — so-called 
'running costs' — as well as on the terminal state. Here, 
we consider only quality functionals / which are deter- 
mined by the terminal state of (E), i.e. I is given by 
some smooth quality function / on the state space M. 
Note, however, that this is in general not a restriction 
as running costs can often be reduced to an end point 
condition, see below. 

Thus, the optimal control task reduces to finding con- 
trols that drive the initial state Xq to a maximum of 

/|Reach(X )> (15) 

i.e. to a maximum of / restricted to what is known as the 
reachable set Reach(Xo) of X$. It is the set of all possible 
states the system can be driven into within positive time. 
Moreover, the set of all states which can be reached just 
in time T will be denoted by Reach(Xo , T) . Thus 

Reach(A ) := (J Rcach(X ,T). (16) 

T>0 

A more detailed discussion on this topic will be given in 
the next subsection. 

Choosing an (equidistant) sufficiently small partition- 
ing = to, . . . , t rip = T of the interval [0, T] and assum- 
img that the set of all states which can be reached in 
time T > by piecewise constant controls is dense in 
Reach(Xo,T), the optimal control task translates into a 
maximisation over the set 

M cp :=Step(t ,...,i„ p ;R^) (17) 

of all piecewise constant controls on [0, T] with fixed 
switching points ti, . . . , t n \. It is important to note 
that M cp is a vector space isomorphic to IR rac ™p, where 
n c and n p arc the respective numbers of controls and 
subintervals. 

In order to express the effect of a piecewise constant 
control on the final state of (S), we define a mapping 

4> T : M cp M, Y ^ X(T, Y), (18) 

where X{t,Y) = X(t, Y\, . . . , Y np ) denotes the unique 
solution of the control system (E) with initial value 
X(0,Y) = Xo and piecewise constant control Yj £ MJ 1 " 
on [tj—i,tj]. Hence, the impact of the controls on the 
quality function is described by the composition 

g T := / o T : M cp JR . (19) 

Often, the map 4>t ■ M cp — > M is 'almost' surjective in 
the sense that the interior of its image <j>r(M cp ) relative 
to Reach(X , T) is open and dense. Therefore, 4>t can be 
interpreted as a highly non-regular over-parametrisation 
of the set Rcach(ATo, T). Now, any standard non-linear 
optimisation tool, e.g. an Euclidian gradient method al- 
lows to compute the (local) maximum of gx and thus to 



7 



g T m = f° + T (Y) 




T/ 



X eM 




(j) T 



M cp ^ R^'"* 




Figure 3: (Colour online) Optimal control task: the quality 
function / : M — > R, X i— > /(X) is driven into a (local) max- 
imum on the reachable set Reach(Xo) C M by following an 
implicit procedure (intermediate panel). It is brought about 
by a gradient flow Y = grad gr(Y) on the level of experi- 
mental control amplitudes Y € M cp (lower traces) so that a 
standard steepest ascent method applies. 

numerically evaluate the overall quality function g de- 
fined by 

9{T) ■■= max ( max 5t , (F) J , 

where Y varies over all piecewise constant controls in 
Step(* , ■ ■ ■ ,t n ' p ,R n ") with t = and t n > p = T' . Hence, 
for solving the OCT, at last one has to increment T > 
until g(T) remains constant. Moreover, time-optimal 
solutions of the OCT are obtained while searching for the 



minimal time T* > such that g(T*) is equal to the 
maximum of g. 

In order to be more explicit, assume that the system 
(E) is given by a time-continuous control system, i.e. by 
an (ordinary) differential equation 

(E) X = F(X,u), ueR m (20) 

depending on some (unrestricted) control parameters u £ 
K m . Thus the general OCT amounts to maximising a 
functional 

l(X,u,T):=f(X(T))+ [ h(X(t),u(t))dt (21) 
Jo 

under the constraint Eqn. (|20p . Here, the integral term 
in Eqn. (|2"Tj) represents the so-called 'running costs'. We 
therefore suppose that h vanishes. This, however, is not 
really a restriction as mentioned above. By augment- 
ing an auxiliary state variable and state equation to (E) 
one can reduce 'running costs' to a terminal state con- 
dition, cf. [gjj Hi)]. Moreover, note that in the setting 
of time-continuous control systems the final scanning for 
the optimal value of g can be accomplished by introduc- 
ing another auxiliary optimisation parameter uo > in 
(E) such that 

(E ) X=u F[X,u), K,«)eM m+1 (22) 

while keeping the time interval [0, T] fixed, e.g. T = 1. 
Further numerical methods from optimal control can be 
found in e.g. [H, M, M, M, HI, US- Some of these 
more involved techniques require Pontryagin's maximum 
principle, which can be viewed as Lagrange-multiplicr 
method for constrained variational problems. These 
techniques often come at the cost of additional boundary- 
value problems. 

To sum up, the quality function /|R e ach(x ) is driven 
into a (local) maximum by an implicit procedure in the 
sense that it is not explicitly defined on the reachable set 
Reach(Xo) C M. Rather it is the result of a gradient 
flow 

Y = grad. 9T (y). (23) 

on the level of control amplitudes Y £ M cp = M n p ™ c so 
that one finally gets the discretised version 

Y k+1 =Y k + a k gmdg T {Y). (24) 

Thus the iterative scheme (j2"4")l reads as a standard steep- 
est ascent method. These ideas are illustrated in Fig. [31 
they are exploited in numerical optimal-control methods 
like the GRAPE algorithm [29| . 

Finally, note that the OCT relates to an AOT if the 
reachable set Reach(Xo) itself has the geometric struc- 
ture of a submanifold N of the state-space manifold M. 
However, in general this is not the case and would have 
to be ensured a priori. Therefore, if one seeks to re- 
duce the OCT to an AOT, geometric control and reach- 
ability aspects come into play as discussed in the next 
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subsection. For instance, in the Hamiltonian unitary 
evolution of a finite dimensional closed quantum sys- 
tem, the closure of the reachable set takes always the 
form of a group orbit of some initial state [70(, e.g. 
O(po) := {Upoll^ | U unitary}. Its Riemannian ge- 
ometry is well understood. In open dissipative systems, 
however, the dynamics is governed, e.g., by a Markovian 
quantum Master equation and thus by a semigroup of 
completely positive operators. As will be illustrated in 
Ref. (58| , it is much harder to give an explicit characteri- 
sation of their reachable sets. Apart from utterly simple 
scenarios, an abstract approach is often unviable in dis- 
sipative systems, and thus implicit methods by optimal 
control techniques may become indispensible. 

Reachability and Controllability 

In the following, some general remarks on reachable 
sets and controllability will clarify the previous, slightly 
sloppy introduction of these term. For simplicity, let (S) 
denote a smooth control system on the manifold M, i.e. 
a familiy of (ordinary) differential equations 

(E) X = F(X,u), ueUcR" 1 (25) 

with control parameters u € U and smooth vector fields 
F u := F(-,u) on AI. While the vector fields F u are as- 
sumed to be time- independent, the controls are allowed 
to vary in time. For convenience, the resulting control 
function t i— > u(t) € U is denoted again by u. Moreover, 
the set U of all admissible controls functions is supposed 
to contain at least all piecewise constant ones. 

For u £ U, we refer to X(t, Xq, u) as the unique solu- 
tion of (|25[) with initial value Xq. Therewith, the reach- 
able set of Xq is defined by 

Reachpfo) := |J Reach(A , T). (26) 

0<T 

Here Rcach(Ao, T) denotes the set of all states which can 
be reached in time T, i.e. 

Reach(Xo,T) := {X(T,X a ,u) e M\ueU), (27) 

cf. 1|16|) . The system (E) is said to be controllable, if 
Reach(ATo) = M for all Xq <G M, i.e. if for each pair 
(Xq,Yq) there exists an admissible control u and a time 
T > such that X(T, X , u) = Y . 

In general, it is hard to decide whether a given sys- 
tem (E) is controllable or not. However, for dynamics 
expressed on some Lie group G, the situation is much 
easier. Let (Eg) be a bilinear or, cquivalcntly, a right 
invariant, control- affine system on a matrix Lie group G 
with Lie algebra g, i.e. 

m 

(E G ) X = {A +^2u j A j )X, u&R m (28) 



with drift Aq £ g and control directions Aj £ g. For 
compact Lie groups G, a simple algebraic test for con- 
trollability is known: If the system Lie algebra 

s:=(A ,...,A m ) Ue (29) 

generated by Aq , . . . , A m via nested commutators coin- 
cides with g, then the corresponding system (Eg) is con- 
trollable, cf. [tTI. |72|. In particular, there exists a finite 
time T' > 0, such that the entire group G can be reached 
from any initial point Xq G G within this time, i.e. 

G= |J Reach(X ,T)=:Reach(Xo,<T') (30) 

0<T<T' 

for all Xq e G. 

In contrast, for non- compact groups G, which are 
indispensible for the description of dissipation in open 
quantum systems, the situation gets more involved. 
Here, s = g implies only accessibility of (Eg), he. that all 
reachable sets Reach(ATo) have non-empty interior. This 
follows from a more general result on smooth non-linear 
control systems — the so-called Lie algebra rank condition 
(larc) 

{F(X) \Fe(F u \ue U)uc} = T X M, (31) 

where (F u | u S U)^ denotes the Lie subalgebra of vec- 
tors fields generated by F u , u g U via Lie bracket op- 
eration, cf. [H, [ill . Note that for right invariant vector 
fields on G, the Lie bracket essentially coincides with the 
commutator such that |3T|) boils down to s = g. More- 
over, by exploiting the identity 

Reach(l, T t ) ■ Reach(l, T 2 ) = Reach(l, T x + T 2 ), (32) 

one can show that Reach(l) is always a Lie subsemigroup 
of G [58| . Here, a subsemigroup is a subset S C G which 
contains the unity and is closed under multiplication, i.e. 
1 £ S and S • S C S. However, the geometry of sub- 
semigroups is rather subtle compared to Lie subgroups 
and therefore at present not amenable to intrinsic opti- 
misation methods, as will be shown in more detail in a 
separate paper dwelling on open systems [58| . 

Settings of Interest 

In terms of reachability, there are different scenar- 
ios that structure the subsequent line of thought: we 
start out with fully controllable or operator controllable 
quantum systems [13, [zl Iz3, Izl, El, S Iz3 repre- 
sented as spin- or pseudo-spin systems. Then, neglect- 
ing decoherence, to any initial state represented by its 
density operator A, the entire unitary orbit O(A) := 
{UAU' 1 1 U unitary} can be reached [Z2}. In systems 
of n qubits (e.g. spin-i particles), this is the case un- 
der the following mild conditions [26|, [13, [lH: (1) the 
qubits have to be incquivalent, i.e., distinguishable and 
selectively addressable, and (2) they have to be pairwise 
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coupled (e.g., by Ising or Heisenberg-type interactions), 
where the coupling topology may take the form of any 
connected graph. In other instances not the entire (uni- 
tary) group, but just a subgroup K can be reached. This 
is the case, if the coupling topology is not a connected 
graph or the n qubits cannot be addressed by separate 
controls. 

Otherwise, the system itself can be fully controllable, 
but the the focus of interest may be reduced: e.g., the 
subgroup K = SC/ioc(2") := SU(2) ® SU{2) <g> • ■ -(8)517(2) 
of (possibly fast) local actions on each qubit is of inter- 
est to study local reachability, or whether an effective 
multi-qubit interaction Hamiltonian is locally reversible 
in the sense of Halm's spin echo f28j. Or, one may ask 
what is the Euclidean distance of some pure state to the 
nearest point on the local unitary orbit of a pure product 
state. This may be useful when optimising entanglement 



witnesses 



54j. Likewise, one may address other than 
the finest partitioning of the entire unitary group, e.g. 
K = SU(2 ni )®---®SU(2 n i)®---®SU(2 n '-) c SU(2 n ), 

where 2j=i n j — n - 

Another type of reduction arises not by restriction to 
a subgroup H, but by the fact that the quality func- 
tion of interest / is equivariant, i.e. constant on cosets 
HG. Consider, for instance, a fully controllable system 
where / is equivariant with respect to the closed sub- 
group H C G. Then, it may be favourable to transfer 
the optimisation problem from the original Lie group G 
to the homogeneous space G/H. 



III. THEORY: GRADIENT FLOWS 

Gradient systems are a standard tool of Riemannian 
optimisation for maximizing smooth quality functions 
on a manifold M. Thus the manifold structure of 
M arises either naturally by the problem itself or by 
smooth equality-constraints imposed on a previously un- 
constrained problem. Note that in general inequality- 
constraints would entail manifolds with a boundary — and 
thus are a much more subtle issue not to be developed 
any further here. 

The case M = M™ — sometimes refered to as the uncon- 
strained case — is well-known and can be found in many 
texts on ordinary differential equations or nonlinear pro- 
gramming, cf. [?9|, Hil [HI, H2|- However, gradient sys- 
tems on abstract Riemannian manifolds provide a rather 
new approach to constrained optimisation problems. Al- 
though the resulting numerical algorithms are in gen- 
eral only linearly convergent, their global behaviour is 
often much better then the global behaviour of locally 
quadratic methods. 

Textbooks combining the different areas of Rieman- 
nian geometry, gradient systems and constrained opti- 
misation are quite rare. The best choices to our knowl- 
edge are f2ll loOll. For further reading we also suggest 



misation approach applied in Section IIVI is scattered in 
many different references. For the reader's convenience, 
we therefore review the basic ideas on these topics. First, 
we discuss the general setting on Riemannian manifolds, 
then we proceed with Lie groups and finally summarize 
some more advanced results on homogeneous spaces. For 
standard definitions and terminology from Riemannian 
geometr y w e refer to any modern text on this subject 
such as 11,113, Hi- 



A. Gradient Flows on Riemannian Manifolds 

In the following, let M denote a finite dimensional 
smooth manifold M with tangent and cotangent bun- 
dles TM and T*M, respectively. Moreover, let M be 
equipped with a Riemannian metric (■{■), i.e. with a 
scalar product (-\-)x on each tangent space TxM vary- 
ing smoothly with X G M. More precisely, (-|-) has to 
be a smooth, positive definite section in the bundle of all 
symmetric bilinear forms over M. Such sections always 
exist for finite dimensional smooth manifolds, cf. [861 ] - 
The pair (M, (-|-)) is called a Riemannian manifold. 

Let / : M — > M be a smooth quality function on M 
with differential Df : M -> T*M. Then the gradient 
of / at X € M, denoted by grad/ (X), is the vector in 
TxM uniquely determined by the equation 



D (grad /POIf) 



x 



(33) 



for all £ € TxM. Equation (|33[) naturally defines a vector 
field on M via 

grad/ : M -> TM, X h-> grad / (X) (34) 

called the gradient vector field of /. The corresponding 
ordinary differential equation 



X = grad/(X), 



(35) 



and its flow are referred to as the gradient system and 
the gradient flow of /, respectively. 

Obviously, the critical points of / : M — > R co- 
incide with the equilibria of the gradient flow. More- 
over, the quality function / is monotonically increasing 
along solutions X(t) of (|3"5)) . i.e. the real- valued function 
1 1 ► f{X{t)) is monotonically increasing in t, as 



(1 
df 



f(X(t)) = (grad/(X(t))|X(i)) x(t) 



||grad/(X(t))|| 



X(t] 



> 0. 



the papers jialig, H3 . Nevertheless, most of the mate- Here 1 1 ■ | \ x den otes th e norm on T X M induced by 
rial which is necessary to understand the intrinsic opti- i.e. ||£||jt : = \J {£,\C)x for all £ G TxM. 



)x, 



10 



Convergence of Gradient Flows 



Restriction to Submanifolds 



Recall that the asymptotic behaviour (for t — > +00) of 
a solution of (|35[) is characterised by its co-limit set 

co(X ) := P| {X(t,X ) I t < t < t+(X )}, 

0<t<t+(X o ) 



where {•••} denotes the closure of the set {•••} and 
X{t, Xq) the unique solution of (|35[) with initial value 
X(0) = Xq and positive escape time t + (Xo) > 0. 

The following result gives a sufficient condition for so- 
lutions of Eqn. ([33)1 to converge to the set of critical 
points of /. 

Proposition III.l. If f has compact a superlevel set, 
i.e. if the sets {X £ M \ f(X) > C} are compact for all 
C 6 M, then any solution of Eqn. L'J5\) exists for t > 
and its co-limit set is a non-empty compact and connected 
subset of the set of critical points of f . 

Proof. Since solutions of Eqn. (13"5|) arc monotonically 
increasing in t, the compact sets {X £ M | f(X) > C} 
are positively invariant, i.e. invariant for t > under 
the gradient flow of Eqn. (|3"5|) . Thus, the assertion fol- 
lows from standard results on w-limit sets and Lyapunov 
theory, cf. dHHj. □ 



Although, Proposition IIII.ll guarantees that co(Xq) is 
contained in the set of critical points of /, this does not 
imply convergence to a critical point. Indeed, there are 
smooth gradient systems which exhibit solutions converg- 
ing only to the set of critical points, cf . [87[ ■ The next two 
results provide sufficient conditions for convergence to a 
single critical point under different settings. In particu- 
lar, Threorem IIII.ll yields a powerful tool for analysing 
real analytic gradient systems. 

Corollary III.l. If f has compact superlevel sets and if 
all critical points are isolated, then any solution of i35\) 
converges to a critical point of f for t — ► +00 . 

Proof. This is an immediate consequence of Propo- 
sition HilU □ 

Theorem III.l (Lojasiewicz). If (M, {•]•}) and f are 
real analytic, then all non-empty co-limit sets co(Xq) of 
Eqn. \35)) are singletons, i.e. co(Xq) 7^ implies that 
X(t,Xo) converges to a single critical point X* of f for 
t — > +00. 

Proof. The main argument is based on Lojasiewicz's 
inequality which says that in a neighbourhood of X* an 
estimate of the type 

\f(X)\v <C||grad/(X)|| 

for some p < 1 and C > holds. A complete proof can 
be found in HE HI. □ 



Now, consider the restriction of / to a smooth sub- 
manifold N C M. Obviously, the Ricmannian metric 
(•|-) on M restricts to a Ricmannian structure on N. 
Thus (iV, (- 1 •) |ttv) constitutes a Ricmannian manifold in 
a canonical way. Moreover, the equality D/|jv(X) = 
D/(X)|x x Af immediately implies that the gradient of 
the restriction /|jv at X € N is given by the orthogonal 
projection of grad/(AT) onto TxN, i.e. 



; rad/| Ar (A) = P x (grad/(A)), 



(36) 



where Px denotes the orthogonal projector onto TxN. 
Hence the gradient system of /|n on an arbitrary sub- 
manifold N is well-defined and reads 



A = J P x (grad/(A)). 



Analysing Critical Points by the Hessian 



(37) 



Subsequently, we address the problem, how to define 
and compute the Hessian of /, as its knowledge is essen- 
tial for a deeper insight of ([55)1 . For instance, the stability 
of critical points is determined by its eigenvalues or the 
computation of explicit discretisation schemes, preserv- 
ing the convergence behaviour of (|35[) . can be based on 

it, cf. [IIlH. 

At critical points X* € M of /, the Hessian is given 
by the symmetric bilinear form Hess/(A'*) : Tx'M x 
T X *M -> R, 

Hess/(X*)(£,7j) := (38) 
(D ^(A*)£) T Hess(.f o Lp^ 1 )( y Lp(X*)) D <p(X*)ri, 

where (p is any chart around X* and Hess(/ o tp^ 1 ) de- 
notes the ordinary Hesse matrix of foip^ 1 . It is straight- 
forward to show that (|38p is independent of <p. Equiva- 
lently, Hess/(A*) is uniquely determined by 



Hcss/(X*)(£,£) := 



d 2 (/oa) 



dt 2 



t=o 



(39) 



where a is any smooth curve with X* = a(0) and 
d(0) = £. While the remaining values of Hess/(A'*) can 
be obtained by a standard polarisation argument, i.e. via 
the formula 

2Hcss/(A*)(e, Tj) - Hcss/(X*)(£ + v , £ + rj) 

-Hess/(X*)(C,0 -Hcss/(A*) (77,77) . (40) 

However, the previous definition does not apply to reg- 
ular points of /. In general, one has to establish the 
concept of geodesies, cf. Remark IIII.ll More precisely, 
the Hessian of / at an arbitrary point x £ M is defined 

by 



Hess/pO(£,£) 



d 2 (/o 7 ) 



dt 2 



t=o 



(41) 
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where 7 is the unique geodesic with X = 7(0) and 7(0) = 
£, cf. |l37j |. Again, the remaining values can be computed 
by ([40)) . As usual, we associate to Hess/(X) a unique 
selfadjoint linear operator Hessf(X) : T^M — > T^M 
such that 

{Z\Hes8f(X) V ) x = Hess/(X)(£, 77) (42) 

holds for all £, 77 € TjM. It is called the Hessian operator 
of / at X e M. 

Remark III.l. In modern textbooks, geodesies are de- 
fined via linear connections on Af, cf. [85l l90j. For Rie- 
mannian manifolds AI, however, it is possible to intro- 
duce (Remannian) geodesies as curves of minimal arc 
length. Both concepts coincide (locally) if we choose the 
so-called Riemannian or Levi-Civita connection on M. 
Unfortunately, the computation of geodesies is in general 
a highly non-trivial problem. However, on compact Lie 
groups their calculation is much easier as we will see at 
the end of Section IIII Bl 

The above concepts yield the following generalisation 
of a familiar result from elementary calculus. 

Theorem III. 2. Let AI be a Riemannian manifold and 
let X* be a critical point of the quality function f : M — > 
K. //Hcss/(X*) or, equivalently, Hessf(X) are negative 
definite, then X* is a strict local maximum of f . 

Proof. Use local coordinates, then the result follows 
straightforwardly from Eqn. ([55)) . □ 

In general, (asymptotic) stability of an equilibrium 
X* 6 M of ()35[) may dependent on the Remannian met- 
ric (-|-). However, the property of being a strict local 
maximum or an isolated critical point of a smooth func- 
tion / is obviously not up to the choice of any Reman- 
nian metric. Therefore, the following result shows that 
in fact certain (asymptotically) stable equilibria X* € M 
of (|35p are independent of the Riemannian metric. 

Theorem III. 3. (a) If X* E AI is a strict local maxi- 
mum of f , then X* is a stable equilibrium of 135\) . 
In particular, for any neighbourhood U of X* there 
exists a neighbourhood V of X* such that the u>- 
limit sets u>(xq) are non-empty and contained in U 
for all xq E V. 

(b) If X* E M is a strict local maximum and an iso- 
lated critical point of f, then X* is an asymp- 
totically stable equilibrium of \35\) . In particu- 
lar, there is a neighbourhood V of X* such that 
u)(xq) = {A*} for all Xq E V, i.e. all solutions 
X(t,Xo) with initial value Xq E V converge to X* 
for t — > +00 . 

Proof. Both assertions follow immediately from clas- 
sical stability theory by taking / as Lyapunov function, 

cf. * □ 

Note that the convergence analysis near arbitrary equilib- 
ria, i.e. near arbitrary critical points of / is quite subtle 
and may depend on the Riemannian metric, cf. (oT| | . 



Discretised Gradient Flows 

Finally, we approach the problem of finding discreti- 
sations of (|35[) which lead to convergent gradient ascent 
methods. The ideas presented below can be traced back 
to R. Brockett, cf. |16| . Let 

cxp x : T X M -► M (43) 

be the Riemannian exponential map at X E M , i.e. 
t 1— > cxp x (/j£) denotes the unique geodesic with initial 
value X E AI and initial velocity £ E T X AI. Moreover, 
we assume that M is (geodcsically) complete, i.e. any 
geodesic is defined for all t € R. Hence, (|4"3")l is well- 
defined for the entire tangent bundle TM. 

The simplest discretisation approach — a scheme that 
can be seen as an intrinsic Euler step method — leads to 

Riemannian Gradient Method 

X k+1 : = cxp Xk (a k grad f{X k )) (44) 
where a k denotes an appropriate step size. 

In order to guarantee convergence of |44)) to the set of 
critical points, it is sufficient to apply the Armijo rule 
[82| . An alternative to Armijo's rule pro vides the step 
size selection suggested by Brockett in [18[ , see also [21J . 
Convergence to a single critical point is a more subtle 
issue. If (A/, (-|-)) and / are analytic, and the step sizes 
are chosen according to a version of the first Wolfe-Powell 
condition for Riemannian manifolds, then pointwise con- 
vergence holds. A detailed proof can be found in [92l |. 

B. Gradient Flows on Lie Groups 

In the following, we apply the previous results to Lie 
groups and Lie subgroups. However, to fully exploit Lie- 
theoretic tools, the Riemannian structure and the group 
structure have to match, i.e. the metric (-|-) has to be 
invariant under the group action. For basic concepts and 
results on Lie Groups and their Riemannian geometry we 
refer to [13, HI]. In particular, we recommend the book 
of Arvanitoyeorgos [94| for a rather condensed overview. 

Let G denote a finite dimensional Lie group, i.e. a 
group which carries a smooth manifold structure such 
that the group operations are smooth mappings. For 
notational convenience we will assume that G can be 
represented (closed) matrix Lie group, i.e. as an 

(embedded) Lie subgroup of some general linear group 
GL(N, K) of invcrtible Ax A-matrices over K = R or C. 

Remark III. 2. According to a well-known result by 
Cartan, a subgroup G C GL(N, K) is an (embedded) Lie 
subgroup, i.e. a smooth submanifold of GL(N, K), if and 
only if it is closed in GL(N,K), cf. [95|. Note, however, 
that there is a subtle difference between embedded and 
immersed Lie subgroups. Moreover, not every abstract 
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Lie group admits a faithful representation as a matrix 
Lie group. Nevertheless, the class of matrix Lie groups 
is rich enough for all of our subsequent applications. For 
more details on these topics we refer to [961 ] - 



Invariant Metrics 

A Lie group G can be endowed in a canonical way 
with a Riemannian metric (-|-). Let g := TiG be the Lie 
algebra of G, i.e. the tangent space of G at the unity 1. 
From the fact that the right multiplication th ■ G — > G 
and left multiplication Ijj : G — > G are diffcomorphisms 
of G for all H G G, it follows 

T H G = qH = Hq (45) 

for all H G G. Now, let (-|-) be any scalar product on g. 
Then 

(g\h) G := (gC'lhG- 1 ) (46) 

for all G G G and g, h G T G G yields a right invariant 
metric on G, where right invariance stands for 

(g\h) G = (gH\hH) GH (47) 

for all G,H G G and g, h G T G G. Thus, right multipli- 
cation ru represents an isometry of G. In the same way, 
one could obtain left invariant metrics on G. 

Remark III. 3. In an abstract setting, one has to replace 
63 by 

T H G = -Dr H (l) 5 = -Dl H (l) 5 (48) 

for all H G G, where D rn and D Ih denote the tan- 
gent maps of ru and Ir, respectively. For a matrix Lie 
group, however, the respective tangent maps are given 
by Dr H (G)£ = £H and DZ H (G)£ = H£ for all G G G 
and £ G T G G. Hence (gSJ) reduces to (|45|) . 

The construction of bi-invariant, i.e. right and left 
invariant metrics is much more subtle and in general even 
impossible. To summarise the basic results on this topic 
we need some further terminology. The adjoint maps 
Ad : G — > GL(g) and ad : g — ► gl(g) are defined by 

Ad G h := GhG^ 1 

and 

ad ff h := [g, h] := gh — hg 

for all G G G and all g,h G fl, where GL{g) and gl(fl) 
denote the set of all automorphisms and, respectively, 
endomorphisms of g. Note both notations &d g /i and [g, h] 
are used interchangeably in the literature. A bilinear 
form (-|-) on g is called 



(a) Adc-invariant if the identity 

(g\h) = (Ad G g\ Ad G h) (49) 
is satisfied for all g, h G g and G G G. 

(b) ad B - invariant if the identity 

(ad g /i|fc) = -(h\ad g k) (50) 

is satisfied for all g, h, k G g. 

Proposition III. 2. The following statements are equiv- 
alent: 

(a) There exists a bi-invariant Riemannian metric (•(•) 
on G. 

(b) There exists an Adc -invariant scalar product (-|-) 
on q. 

Moreover, each of the statements (a) and (b) imply 

(c) There exists an a,d g -invariant scalar product (-|-) on 
9- 

If G is also connected, then (c) is equivalent to (a) and 
(b), respectively, 

Proof. The equivalence (a) ^=> (b) follows easily by 
exploiting Eqn. (|4T|) at G = 1. Moreover, applying (b) 
to a one-parameter subgroup t h- > cxp(tg) and taking the 
derivative with respect to t = yields (c). The impli- 
cation (c) (b) is obtained in the same way, i.e. by 
taking the derivative of 

1 1 — ► ( Ad c tg h | Ad e t s fc) . 

with respect to t, cf. [94|. Note, however, that this im- 
plies AdG-invariance only on the connected component 
of the unity. Therefore, connectedness is necessary for 
the implication (c) (b) as counter-examples show. □ 

Now, the main result on the existence of bi-invariant met- 
rics reads as follows. 

Theorem III. 4. A connected Lie group G admits a bi- 
invariant Riemannian metric if and only if G is the direct 
product of a compact Lie group Go and an abelian one, 
which is isomorphic to some (R m , +), i.e. G = Go x K m 

Proof. Cf. 13, [93- □ 

Finally, we focus on a special class of Lie groups. 
A connected Lie group G is called semisimple if the 
Killing form, i.e. the bilinear form (<?, h) 1— * n(g, h) := 
tr(ad 9 adft) is non-degenerate on g. Most prominent 
representatives of this class are SL(N,M.), SL(N,C), 
SO(N,]&) and SU(N). More on semis imp le Lie groups 
and their algebras can be found in [93|, |95| . 

Theorem III. 5. (a) If G is semisimple then the 
Killing form n defines an &d g -invariant bilinear 
form on g. 

(b) If G is semisimple and compact then — n defines 
an &d g -invariant scalar product on g. Thus —k in- 
duces a bi-invariant Riemannian metric on G. 

Proof. Cf. [13,[9l. □ 
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Gradient Flows with Respect to an Invariant Metric 

Next, we study gradient flows on G or on a closed 
subgroup H C G with respect to an invariant metric (-|-). 
Therefore, let / : G — > R be a smooth quality function 
and let ip : G — > G be any diffeomorphism. Using the 
identity 

grad(/o^)(G) = (D^(G))*grad/(vKG9) (51) 

for all G £ G, where (■)* denotes the adjoint operator, 
we obtain by the right invariancc of the metric 

grad/(G)=grad(/or G )(l)G (52) 

for all G £ G. Hence 

G = grad/(G). (53) 

can be rewritten as 

G = grad(/or G )(l)G. (54) 

Thus, the gradient flow of / is completely determined 
by the mapping G i— ► grad(/ o r G )(l) £ 0- To study 
its asymptotic behaviour of Eqn. (|53|) we can apply the 
results of the previous section. For instance, for compact 
Lie grops we have. 

Corollary III. 2. Let G be a compact Lie group with a 
right invariant Riemannian metric (-|-) and let f : G — > 
R be a real analytic quality function. Then any solution 
of Eqn. &53\) converges to a critical point of f for t — > 
+oo. 

Proof. This follows immediately from Proposition 
IIII. II and Theorem lIII.ll as the pair (G, (-|-)) constitutes 
a real analytic Riemannian manifold whenever the metric 
(■|-) is invariant, cf. (98[. □ 

Now, let H be a closed subgroup of G. By Remark 
IIII. 21 we know that H is actually an (embedded) sub- 
manifold of G. Therefore, the gradient flow of /|h with 
respect to is well-defined and can be given explic- 

itly via the orthogonal projectors Pjj, cf. (|37p. However, 
for an invariant metric the computation of Ph simplifies 
considerably, as all calculations can be carried out on the 
Lie algebra g of G. 

Lemma III.l. Let G be a Lie group with a right invari- 
ant Riemannian metric (-|-) and let H be a closed sub- 
group ofG. Furthermore, let g and t) their corresponding 
Lie algebras and denote by P^ the orthogonal projection 
of Q onto t) . Then the orthogonal projection Ph in h3b\) 
is given by 

P H {gH) := P h {g)H (55) 

for all gH £T H G. 

Proof. This is a straightforward consequence of the 
identity T#H = t)H and the right invariancc of (• | •) . □ 

According to ([37]). (|52j) and ([gg]). the gradient flow of f\ H 
finally reads 

H = P„(grad(/o m )(!))#. (56) 



Geodesies with Respect to an Invariant Metric 

The remainder of this subsection is devoted to the is- 
sue: How to compute geodesies and the Hessian of a 
smooth quality function with respect to an invariant met- 
ric. The main results for the forthcoming applications are 
summarised in Theorem IIII.6f b) and Proposition IIII. 31 
For readers with basic differential geometric background 
we provide some details of the proof which however can 
be skipped, so as not to lose the thread. First, we need 
some further notation. Let 

X r g : G h+ gG and X l g : G ^ Gg (57) 

be the right and left invariant vector fields on G which 
arc uniquely determined by X g (T) = g and X l g (T) = g, 
respectively. Moreover, let L x (-) denote the Lie deriva- 
tive with respect to the vector field X, i.e. for a smooth 
function / : G — > R one has 

L x (f)(G) :=Df(G)-X(G). 

On vector fields y, the action of L x (-) is given by 

L x {y){G) := 

r (D^G))- 1 -y(^ x (t,G))-y(G) 

— lim 1 , 

t — ^oo t 

where &x{t, ■) denotes the corresponding flow of X. 

Next, we recall two basic facts from differential geom- 
etry which play a key role for the proof of Theorem IIII. 6 1 
The first one shows that the set of right/left invariant 
vector fields is invariant under Lie derivation, cf. [93|, [94| . 
The second one relates a Riemannian metric of a man- 
ifold M with a particular linear connection on M. For 
more details see e.g. [H|. 

Fact 1. The Lie derivative of a right /left invariant vector 
field is again right/left invariant and satisfies 

L x^h = - X [ g ,h] and LxiXh = x [ g ,hY ( 58 ) 

Fact 2. On any Riemannian manifold M there exists 
a unique Riemannian connection V determined by the 
properties 

L x y = v x y-v y x (59) 

and 

v x {y\z) = (v x y\z) + (y\v x z). (60) 

Now, combining both facts yields the main result about 
geodesies on Lie groups. 

Theorem III. 6. Let G be a Lie group with a bi- 
invariant metric (-|-) and let V denote the unique Rie- 
mannian connection on G induced by (-|-). 

(a) For right/left invariant vector fields the Rieman- 
nian connection V is given by 

^xiK = -\x{ gM ™d V xl X l h = l -X[ gM . (61) 
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(b ) The geodesies through any G £ G are of the form 
1 1— ► Gexp(tg) or t ^ exp(tg)G with g € g. In par- 
ticular, the geodesies through the unity t are pre- 
cisely the one-parameter subgroups ofG. 

Proof, (a) Applying Koszul's identity, cf. [Mill, 

2{v x y\z) = L x {y\z) + L y {z\x)-L z {x\y) 
-{x\L y z) + (y\L z x) + (z\L x y), 

to X g , X£ and XT we obtain 
2{V x; X£\Xj;) = 

+ ( X l \ X \h,k}) - { X h\ X [k,g}) ~ ( X k\ X [g,h])- 

Now Proposition IIII.2I and Fact Q] implies 



2{V X rXl\Xl) = -(X r k \X[ gM ) 



and hence 



~2 \sM- 



Obviously, for left invariant vector fields the same argu- 
ments apply. 

(b) Let 7(t) := exp(tg)G. Part (a) implies that the co- 
variant derivative V^( t )7(i) = V x^ X g(^(t)) of 7 van- 
ishes and thus 7 represents the unique geodesies through 
G with 'initial velocity' £ = qG. The same holds for 
7(i) := Gexp(i 5 ), cf. H or □ 

Observe that the bi-invariance of the metric and the in- 
variance of the vector fields are essential for the above 
result. For example Eqn. (|61[) fails, if the Riemannian 
metric is just right invariant. More details on this topic 
can be found in [H, Hi] 

Finally, by Theorem lIII.6[ the Hessian of the restriction 
/|h can easily be obtained by restricting the Hessian of 
/ to TH. More precisely, we have. 

Proposition III. 3. Let f : G — > ffi be a smooth quality 
function on a Lie group with bi-invariant metric (-|-) and 
let H be a closed subgroup. Then the Hessian of f\n at 
H is given by 



Hess/| H (# ) = Hess/t-Ef) 



TffHxTff H 



(62) 



Note that in general Eqn. (f6"2"| is sheer nonsense unless 
H is a Lie subgroup. Counter-examples can be obtained 
easily for G = R m . 



C. Gradient Flows on Homogeneous Spaces 

The subsequent section on homogeneous spaces is mo- 
tivated by the following observation, cf. Subsection lHI Dl 
As before, let / : G — > R be a smooth quality function. 



In many applications / can be decomposed into a func- 
tion F defined on a smooth manifold M and a (right) 
group action a : (X, G) 1— ► X ■ G on M such that 



f(G) := F(X ■ G) 



(63) 



for some fixed IeM. Then we can think of / as defined 



on the orbit of X. More precisely, let f — F 



0{X) 



where 



0(X) := {X ■ G I G G G} denotes the orbit of X. Thus, 



f(Y) = f(G) 



(64) 



for Y = X ■ G with G E G. Such quality functions 
/ are called induced by F, cf. Subsection IIII Dl By 
construction, we have 



max f(G) = max f(Y). 
ggg ' reo(x) y ' 



(65) 



Moreover, let H x ■= {G e G | X ■ G = X} denote the 
stabiliser or, equivalently, isotropy subgroup of X. Then 
/ can also be viewed as a function on the right coset space 



G/H A - := {H X G I G g G}, 



(66) 



cf. [13q . which is equivalent to say that / is equivariant 
with respect to Hx, i.e. 



/(G) = f(HG) 



(67) 



for all H G Ha. Therefore, coset space show up quite 
naturally in optimising equivariant quality functions. 
Note that passing from G to G /H x can be rather useful 
in order to avoid certain degeneracies such as continua of 
critical points. 



Coset Spaces 

We first collect the fundamental facts on the differ- 
ential structure of G/H, where H is any closed sub- 
group of G. Detai led expositions can be found in 

I93LI94I. [Tool. [Toil, fiol. 



Theorem III. 7. Let G be a Lie group with Lie algebra 
q and let H C G be a closed subgroup with Lie algebra f). 
Moreover, let p be any complementary subspace to f), i.e. 
q = t) © p. Then the following holds: 

(a) The quotient topology turns the set of right cosets 
G/H := {[G] := HG GeG} into a locally com- 
pact Hausdorff space. 

(b) There exists a unique manifold structure on G/H 
such that the canonical projection H : G — > G/H, 
G 1—* [G] is a submersion. In particular, the tan- 
gent space o/G/H at [1] is isomorphic to p via the 
canonical identification p 1— > ^ [exp tp] 1 1 „ and thus 
dimG/H = dimG - dimH. 

The following statements refer to the unique manifold 
structure on G/H given in part (b). 
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(c) The Lie group G acts smoothly from the right on 
G/H via 



([G'],G) i — ► [G'G] 



(68) 



such that 

? G ■ G/H -► G/H, [G'] i > [G'G] (69) 

are diffeomorphisms for all G £ G. Moreover, 

IIoI G : G -> G/H, G' i-> [GG'] (70) 

are submersions for all G' € G. Thus, the tangent 
space T[g]G/H is given by 

T [G] G/H = Df G ([l])T [1] G/H 

= D(no/ G )(l) fl (71) 
= D(no/ G )(l)(Ad G - iP ). 

(d) Moreover, if H is a normal subgroup, i.e. 
GHG -1 = H for all G € G, then the multipli- 
cation [G] ■ [G'] := [GG'] is well-defined and yields 
a Lie group structure on G/H. 



Proof. Cf. m 



□ 



The Lie group G/H given by Theorem IIII.7I (d) is called 
the quotient Lie group of G by H. Moreover, the result 
provides the possibility to extend the well-known First 
Isomorphism Law to the category of Lie groups. 

Theorem III. 8. Let $ : G — > G' be a smooth sur- 
jective Lie group homomorphism. Then there exists a 
well-defined Lie group isomorphism $ : G/H — * G' with 
H := ker$ such that the diagram 



(72) 



commutes. Moreover, let g, q' and t) denote the cor- 
responding Lie algebras and let p be any complementary 
space to f). Then D <j>(l) is a surjective Lie algebra homo- 
morphism with ker D <&(]!) = f) and commutative diagram 




D*(l 



Dn(i) 




(73) 



P = fl/f) • 

Proof. Note that H = ker $ is a closed normal 
subgroup of G. Thus, by the First Isomorphism Law 
$([G]) := $(G) for [G] G G/H is a well-defined group 
isomorphism. Moreover, $ is smooth, since LI is a smooth 
submersion by Theorem IIII. 71 The assertion that D <&(!) 
is a surjective Lie algebra homomorphism, follows easily 
from the properties of the exponential map. Finally, a 
straightforward application of the chain rule yields Eqn. 

H73). □ 



Orbit Theorems and Homogeneous Spaces 

Next, we analyse the relation between group actions 
and coset spaces. A smooth right Lie group action is a 
smooth map a : M X G -> M, {X,G) h-> X ■ G which 
satisfies 

(X ■ G) ■ H = X ■ (GH) and X -1 = X 

for all X G M and G, H G G. The orbit of X G M under 
the group action a is defined by 0(A) := {A ■ G | G € 
G}. The action is called transitive if M — 0{X) for 
some and hence for all X G M. Equivalently, one can 
say that for all X,Y G M there exists an element G £ G 
with y = X ■ G. Moreover, for X e M let H x := 
{G E G \ X ■ G ~ X} denote the stabiliser of X and 
cex : G — > M the map G i— > X • G. Then the canonical 
map ax ■ G/Hx — > M is defined by [G] i— ► X ■ G. 

Theorem III. 9 (Orbit Theorem). Let G be a Lie group 
with Lie algebra q and let a : M x G — * M be a smooth 
right action of G on a smooth manifold M . Moreover, 
let X be any point in M . Then the following statements 
are satisfied: 

(a) The stabiliser subgroup Hx is a closed subgroup of 
G. 



(b) Let t)x be the Lie algebra ofHx- Then 

kerD ax(t) — \)x ■ 

In particular, the canonical map ax ■ G/Hx 
is an infective immersion. 



(74) 
-> M 



(c) The canonical map ax is an embedding, i.e. 0(X) 
is a submanifold of M diffeom orph ic to G/Hy, if 
and only if ax *s proper, cf. \139l L In this case, 
the tangent space of O(X) atY^X-G is given by 



TyO(X) = D ax(G) T G G 
= D«y(l)fl 
= Day(l) Ad G -i p x , 



(75) 



where px is any complementary subspace of l)x> 
i.e. g = t)x®Px- 

Proof, (a) The continuity of ax implies that Hx = 
a x 1 (X) is closed. 

(b) In order to see that ax is an injective immersion, 
consider the identity ax ° r G = a(ax{-), G) and thus 

D ax (G) ■ <?G = Di a(X, G)oBa x (l)g. 

Therefore, D ax(l) <? = implies 

^_a x (exp(tg)) = 

for all t G M and hence kerDax(l) C \)x- As 
the inclusion \)x C kerDajf(l) is obvious, we obtain 
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kerDajf(l) = \)x- Moreover, let px be any comple- 
mentary subspace of f)x- Then, identifying px with 
T[!]G/Hjf yields Da x ([t\) = Da x (l)| p , cf. Theorem 

IIII. 71 Thus, is injective and the same holds for 

any other [G] G G/Hx by right multiplication rc- 

(c) The first part follows from a standard embedding 
criterion on immersed manifolds, cf. (86J. The first equal- 
ity of Eqn. 1|75|) is a straightforward consequence of the 
identity ax = Sx ITy, where ITx : G — ► G/Hj denotes 
the canonical projection. The second one is obtained by 
ay = c(x°Ig = ttjor G o Ad G , while the third one follows 
from Hy = Ad G -i H x . For further details see also [2l| . 
□ 

Corollary III. 3. Let a : M x G — > M be as in Theorem 
MIL ffl and let X G M be any point. 

(a) If G is compact then G/Hx is diffeomorphic to 
0{X). 

(b) If a is transitive then G/Hj is diffeomorphic to 
M. 

Proof, (a) This follows immediately from Theorem 
IIII.9f c) and the compactness of G. 

(b) Observe that transitivity of a implies surjectivity of 
Da x (G) and Dq x ([G1). Thus, Theorem [mj^b) yields 
the desired result, cf. [l02j |. □ 

This gives rise to the following definition. A manifold M 
is called a homogeneous G- space or for short a homoge- 
neous space, if there exists a transitive smooth Lie group 
action of G on M. In particular, any coset space G/H 
can be regarded as a homogeneous space via the canoni- 
cal action ([(?'], G) i-> [G'G] for [G 1 ] G G/H and G G G. 
Further results on homogeneous spaces, orbit spa ces and 
principal G-bundlcs can be found in [90, [98|, Il02| 



Remark III. 4. Note that by Theorem [11191 the orbit 
0(X) carries always a manifold structure the topology 
of which is equal or finer than the topology induced by 
M. 



Reductive Homogeneous Spaces 

Let M be homogeneous space with transitive Lie group 
action a:MxG^M and let H := Hx be the stabiliser 
subgroup of a fixed element X G M . Next, we are inter- 
ested in carrying over the Ricmannian structure of G to 
M or, equivalently, to G/H. First, we need some further 
terminology As most of the following terms arc con- 
veniently expressed via algebraic properties of the pair 
(G,H), we focus on the case M = G/H. Nevertheless, 
one could restate all result in terms of an abstract group 
action a on M. 

A homogeneous space G/H is reductive, if the Lie al- 
gebra f) of H has a complementary subspace p in g such 
that p is Adn-invariant, i.e. HpH^ 1 C p for all H G H. 



A Ricmannian metric (-|-) on G/H is called G- invariant 
if the mappings vq are isometries, i.e. if the identity 

mia>] = (B? G ([G'])C\B? G ([G']) V ) [G , G] (76) 

is satisfied for all £, r\ G T [G ,]G/H and G, G' G G. More- 
over, a bilinear form (-|-) on p is called 

(a) Adn- invariant if the identity 

{p\p') = (Adff p\ Adtf p') (77) 
is satisfied for all p,p' G p and H G H. 

(b) adf,- invariant if the identity 

(aA h p\p') = -(p\ad h p') (78) 

is satisfied for all p,p' G p and h G f). 

Note that G/H is reductive, if G has a bi- invariant met- 
ric, as one can choose p := f)^. Next, we give a general- 
isation of Proposition IIII . 21 and Theorem IIII. 41 to homo- 
geneous spaces. 

Proposition HI. 4. Let G/H be a homogeneous space 
with reductive decomposition g = f) © p. The following 
statements are equivalent: 

(a) There exists a G-invariant metric (-|-) on G/H. 

(b) There exists an Adn-invariant scalar product (-|-) 
on p. 

In addition, ifH is connected then (a) and (b) are equiv- 
alent to 

(c) There exists a adf, -invariant scalar product (-|-) on 



Proof. Cf. [al] and Proposition UTOl 



□ 



Theorem III. 10. Let G/H be a homogeneous space 
with reductive decomposition g = t) p. Then G/H ad- 
mits a G-invariant metric if and only if the closure of 
Ad H |p := {Ad H : p — > p \ H G H} is compact in GL(p). 



Proof. Cf. \84. 



□ 



Remark III. 5. (a) As a special case, Theorem IIII. 101 
implies the existence of bi-invariant metrics on 
compact Lie groups, cf. Theorem IIII.4I and [84l ]. 

(b) Replacing p by the quotient space g/t), allows to 
state Theorem IIII. 101 without referring to any re- 
ductive decomposition g = f) © p of g, cf. [84| . 
Moreover, it can be shown that any homogeneous 
space G/H w hich admits a G-invariant metric is 
reductive, cf. [103 1 . 
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Theorem IIII.10I can easily be rephrased for an arbi- 
trary homogeneous G-space M with transitive group ac- 
tion a : M x G — > M, by choosing H :— Hx with 
X G M. Note however, for orbits M := O(X) embed- 
ded in some larger Ricmannian manifold N, the invariant 
metric given by Theorem MI. 101 does in general not co- 
incide with the induced metric. This gives rise to the 
following definition. 

A manifold AI is called a Riemannian homogeneous G- 
space or for short Riemannian homogeneous space, if M is 
a homogeneous G-space with a-invariant metric, which 
is to say that the mappings a G ■ M — ► M, a G (X) := 
X ■ G are isometries of M for all G G G, i.e. 

(t\ri) x = (D a G (X)C\Ba G (X)r,) XG (79) 

for all £, t) G T X M and G G G. 

Proposition III. 5. ("aj ^4ny homogeneous space of 
the form G/H a G-invariant metric is a Rie- 
mannian homogeneous space. 

(b) Any Riemannian homogeneous space is isometric 
to a homogeneous space of the form G/H with a 
G-invariant metric. 

Proof. This follows straightforwardly from the pre- 
vious definitions and Corollary 1111.3T b) . □ 

Naturally Reductive Homogeneous Spaces and Geodesies 

Characterising the Riemannian connection of a homo- 
geneous space and its geodesies are in general advanced 
issues which we do not wa nt to address here, cf. [94| and 
the references therein e.g. 104j . However, there are two 
cases - see (a) and (b) below - which are easy to handle. 
A homogeneous space G/H is called 

(a) naturally reductive if it is reductive with comple- 
mentary space p and Adn-invariant scalar product 
(•|-) on p such that the identity 



(Pa.d g h\k) = -{h\Pad g k) 



(80) 



is satisfied for all g,h,k G p, where P denotes the 
projection onto p along \). 

(b) Cartan-like if it is reductive with complementary 
space p and Adn-invariant scalar product (-|-) on 
p such that the commutator relations 

[M]cf), [tl,p]cp and [p,p]cf). (81) 

are satisfied. 

Remark III. 6. If, in definition (a), the complementary 
space p can be chosen as the orthogonal complement of t) 
with respect to some AdG-invariant scalar product (-|-) 
on g, then condition (|80|) reduces to 



(ad ff h\k) = —(h\ &d g k) 



(82) 



Lemma III. 2. Any Cartan-like Riemannian homoge- 
neous space G/H is naturally reductive. 

Proof. By the commutator relation [p,p] C t), we 
have P&d g h = for all g, h G p. Thus Eqn. (JSC]) is 
obviously satisfied. □ 

Theorem III. 11 (Cosct Version). Let G/H be naturally 
reductive. Then G/H is Riemannian homogeneous space 
such that all geodesies through [G] € G/H are of the form 



ti-> [Gexp(i Ad G -ip)] = [exp(tp)G] 



(83) 



with p G p . 



Proof. Obviously, G /H is Ricmannian homogeneous 
space by Proposition IIII.41 For a proof for Eqn. (|53")l we 
refer to [13, □ 

The above result can be restated for an arbitrary nat- 
urally reductive Riemannian homogeneous G-space. 

Theorem III. 12 (Orbit Version). Let M be a homoge- 
neous G-space with transitive group action a : M x G — > 
M. Assume that G/Hy is naturally reductive with de- 
composition Q = hx © Px ■ Then M is a Riemannian 
homogeneous G-space such that all geodesies through 
Y = X ■ G G M are of the form 



tt-^Y- exp(i Ad G -i p) 



(84) 



with p G px- 



Proof. The result is a straightforward consequence 
of Theorem |nnU □ 



Thus, on naturally reductive spaces, the Riemannian 
exponential map is particularly si mple to compute. By 
taking the basic framework of Ref. [lOll ] further to discuss 
geodesies, Figure |4] illustrates that only in naturally re- 
ductive homogeneous spaces the geodesies project from 
the group G to give geodesies on G/H. Therefore, in 
this sense, in naturally reductive homogeneous spaces 
projection and exponentiation of tangent vectors com- 
mute. However, on reductive homogeneous spaces that 
are not naturally reductive, the problem is considerably 
more involved. A necessary and sufficient condition for 
t i — » [G'exp (tq)1 being a geodesic in G/H can be found 
in [94.[l05|. 

On the other hand, for numerical purposes it is often 
enough and even advisible to approximate the Rieman- 
nian exponential map by another computationally more 
efficient local parametrisation. Here, the map 



p 3 p ^ Ho l G o cxp(Ad G -i p) 



(85) 



for all g, h, k G p. 



might be a natural candidate, even if it fails to give the 
exact Riemannian exponential map. These issues are 
subject to c urren t research, and recent details can be 
found in 0, Il06| | . Figure 2] also shows how in reductive 
homogeneous spaces that are no longer naturally reduc- 
tive, the projected geodesic still provides a first-order ap- 
proximation to the geodesic generated by the projection 
of the tangent vector. 
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exp(p) 



TT: G^G/H 



G/H 




Figure 4: (Colour online) Geodesies in reductive homogeneous 
spaces G/H. The tangent vector p € p projects to the tangent 
vector £ at the coset [1] = H. Note that only in naturally re- 
ductive homogeneous spaces the geodesic in G generated by p 
projects onto G/H such that it coincides with the geodesic of 
the projected tangent vector in the sense Il(e tp ) = exp[jj(t£). 
In reductive homogeneous spaces that are not naturally reduc- 
tive, the projection yields only a first-order approximation at 
[1] — H as shown in the lower part, where II(e ip ) / expr-,,(t£). 



Adjoint Orbits 

A prime example for naturally reductive homogeneous 
spaces is provided by the adjoint action of a compact Lie 
group — a scenario which is of major interest in the forth- 
coming applications. Therefore, we summarise the previ- 
ous results for the particular case of adjoint orbits. Note 
that the adjoint action given by (X, G) i— ► Ado X := 
GXG -1 is a left action. However, all previous state- 
ments and formulas remain valid mutatis mutandis, e.g., 
right cosets have to be replaced by left cosets, etc. 

Corollary III. 4. Let G be a Lie group with Lie algebra 
q and let K C G be a compact subgroup with Lie algebra t 
and bi-invariant metric (■{■). Moreover, let a : gxK — ► g, 
(X,K) i — ► A&k X := KXK" 1 be the adjoint action o/K 
on Q and denote by ax ■ K — > g the map K t— > A<1k X . 
Then the following assertions hold 



(a) The stabiliser group H := Hj of X is a closed 
subgroup o/K. 

(b) The coset space K/H is diffeomorphic to the ad- 
joint orbit O(X) := {Ad K X \ K e K} of X. In 
particular, the map ax '■ K/H — ► O(X), [K] i— > 
Adif X is a well-defined diffeomorphism satisfying 
the commutative diagram 



K 



O(X)C0 



(86) 




K/H 



(c) Let f) := \)x denote the Lie algebra of H and 
p be any complementary space to t) in t, then 
Da(l) = — adx is a surjective homomorphism with 
ker adjc = f) and commutative diagram 



Da x (l) 




TxO(X) c g 



(87) 



5x([l]) 



Moreover, the tangent space of O(X) at Y = 
Adx X is given by 

T y O(X) = ady- 1 = ad y (Ad K -i p). 

(d) O(X) = K/H is naturally reductive. More pre- 
cisely, p := fj- 1 - yields a naturally reductive decom- 
position oft with Adn- invariant scalar product on 
p is given by the restriction of (-|-). 

(e) There is a well-defined a-invariant metric on O(X) 
given by 



Ad k X 



with £ = ady ( Adx , f\ — ady ( Adx Ptj) and 
P(,,Pv e P- 

(f) All geodesies through Y = Ad K X E 0(X) with 
respect to the metric given in part (e) are of the 
form 



Ad oxp (( Adjf p) Y 



(89) 



with p £ p. 



Proof. Part (a) and (b) follow immediately from 
Theorem [OL9] and Corollary |TTL3] 



(c) For k E t we have 
d 



dt 



Ad 



cxp(ifc) 



x 



t=o 



adx k 



and thus Da(l) = — ad^- All other statemants are again 
consequences of Theorem IIII.91 
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(d) First, observe that the bi-invariance of (-|-) implies 
that 6 = \) © p with p := (j- 1 is reductive. Now, let P 
denote the orthogonal projection onto p. In turn, the 
bi-invariance of (-|-) yields 

{Pad g h\k) = (ad g h\k) = -{h\ad g k) = -(h\Pad g k) 

for all g,h,k G p, cf. Proposition IIII.21 Therefore, 
O(X) = K/H is naturally reductive. 

(e) Let Y G O(X) and K G K. A straightforward 
calculation using the identities Da^(y)^ = Ad^£ for 
£ G TyO(X) and Ad^ (ady k) = ad A d^r(Ad^ k) for all 
fc £ I yields the required invariance. 

Part (f) follows immediately from Theorem IIII. 121 and 
the identity l)y = Ad^ t)x for Y = Adx X which implies 
i)^ = Ad K p. □ 



Gradient Flows on Riemannian Homogeneous Spaces 

Applying the previous results on gradient flows to 
quality functions / on Riemannian homogeneous spaces 
G/H, wc obtain by the G-invariance of the Riemannian 
metric — similar to (|52p — the gradient equality 

grad f([G}) = Df G ([l]) grad(/ o r G ) ([!}) (90) 

for all G G G, where r G denotes the mapping [G'} i— > 
[G'G]. Therefore, the gradient of / is completely deter- 
mined by 

G^grad(/of G )([l])Gp. (91) 

However, Eqn. (|91|) does not induce a mapping from 
G/H to p, as in general 

grad(/o? G )([l]) ^ grad(/°r ffG )([l]) 

for H G H \ {!}. The corresponding gradient system 
reads 

[G] =Df G ([l])grad(/of G )([l]). (92) 

For analysing the asymptotic behaviour of Eqn. (|92p . 
Subsection IIII Al again provides the appropriate tools. 
For instance, if G/H is compact we have. 

Corollary III. 5. Let G/H be a compact Riemannian 
homogeneous space and let f : G/H — > R be real analytic. 
Then any solution of Eqn. converges to a critical 

point of f for t — > +oo . 

Proof. This follows immediately from Proposition 
IIII. II and Theorem IIII. II as a Riemannian homogeneous 
space constitutes always a real analytic Riemannian man- 
ifold, cf. mm. D 



Finally, we return to our starting point and ask for 
the relation between (|53[) and (f9"2"|) in the case of an H- 
cquivariant quality function /. Here, / induces a quality 
function f on G/H via 

f([G}) := f{G) (93) 

for all G G G. Moreover, assume G carries a bi-invariant 
metric (-|-) and G/H is a homogeneous space with reduc- 
tive decomposition g = () © p and p := t)^. This implies 
that the restriction of (• | •) to p x p is Adn-invariant. Now, 
the identity f oU = f yields 

D/([G])-Dn(G) = D/(G) forallGeG. (94) 

and hence 

(D 11(G)) * grad/([G]) = grad/(G) (95) 

for all G G G, where II denotes the canonical projection 
and (•)* the adjoint mapping. By identifying p with the 
tangent space of G/H at [1], the map DII(l) represents 
the orthogonal projector h + p i— > p for h G I) and pep. 
Hence, we obtain 

Dn(l)(DH(l))* =id p . (96) 

In the same way, using the identity n o r G = r G o n, one 
shows 

Dn(G)(Dn(G))* =id T[G]G / H (97) 

for all G G G. Thus $E§ yields 

grad/([G]) = Dn(G) grad /(G) (98) 

for all G G G. Therefore, we have proven the following 
result: 

Theorem III. 13. Suppose G/H satisfies the above as- 
sumptions and f : G — > K is a H-equivariant qual- 
ity function with induced quality function f : G/H — > 
R. Then the canonical projection of the gradient flow 
of Eqn. i53\) onto G/H yields the gradient flow of 
Eqn. A92\) , i.e., if G(t) is a solution of Eqn. \53\) then 
n(G(t)) is one of Eqn. (U§ . 

D. Examples 

Often practically relevant quality functions take the 
form of a linear functional restricted to an adjoint orbit 
O(X). For instance, in quantum dynamics the unitary 
orbit 0(A) := {UAW | U G SU(N)} of an initial state A 
plays a central role, because it defines the largest reacha- 
bility set under closed Hamiltonian dynamics. Then the 
set of feasible expectation values is such a linear map, 
since it is the projection onto an observable G in the 
sense of a Hilbert-Schmidt scalar product. These ex- 
pectation values can be generalised to arbitrary complex 
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square matrices A, C G c^xjv as ^ coincide with 
the C '-numerical range 



W(C, A) := {tT(tfUAtf) | U G SU(N)} 



(99) 



As C-numerical ra nges are w ell established in the math- 
ematical literature [1071 . 1 1 081 ] , in the sequel we will adopt 
the notation. 

Note that finding the maximum absolute value, i.e., 
the C-numerical radius 

r(C,A):= max \tr{CftUAUH\ (100) 

ueSU(N) 

is straightforward for Hcrmitian A, C (it amounts to sort- 
ing the respective eigenvalues, cf. Corollary IIII.8[) , while 
for arbitrary complex A, C there is no general analyti- 
cal solution. Moreover, when restricting to local unitary 
operations K G SU\ oc (2 n ) := SU(2)® n , the maximisa- 
tion task becomes non-trivial even for Hcrmitian A, C 

fio9iirrot 

Therefore, we now illustrate the previous theory by 
gradient flows on the entire unitary group SU(2 n ), on 
the local unitary group SU (2)®" as well as their adjoint 
orbits. 



Worked Example: SU{N) 

Recall that SU(N) is a compact connected Lie group 
of real dimension ./V 2 — 1. Its Lie algebra, i.e. its tangent 
space at the identity is given by set su(N) of all skew- 
Hermitian matrices £1 with trQ = 0, i.e., 



su(N) := {fl G 



••NxN 



fit 



-fi, trf2 = 0}. (101) 



So elements !7 G su(N) relate to Hamiltonians H via 
fi = iH. The tangent space at an arbitrary element 
U G SU(N) is 

TuSU(N) = su(N)U = {nu | n G su(N)}, (102) 

cf. Eqn. (fl5|) . Moreover, let SU(N) be endowed with the 
bi-invariant Ricmannian metric 



(riU\EU)u :=tr(ntS), 



(103) 



defined on the tangent spaces TuSU(N), cf. Eqn (JUJ). 

Now set 

g : SU(N) -> C NxN , g{U) := C^UAU^ 
/ : SU{N) K, f{U) := Retr{C^UAU^} 

For computing the tangent map of g, we exploit the 
fact that SU(N) is an embedded submanifold of C NxN . 
Therefore, the tangent map is obtained by restricting the 
ordinary Frechet derivative Dg(U) to the tangent space 
TjjSU(N), cf. Appendix A. Thus, by applying the prod- 
uct rule, one easily finds 



Bg(U)(nU) = C^ilUAU^ + c ] uA(wy 

= c^nuAu^ -c^uAu^n . 



(104) 



(105) 



Now, the chain rule as well as the short-hand notations 
A := UAU> and [■, -]s to denote the skew-Hcrmitian part 
of the commutator [•, ■] give 

Df(U)(QU) = D(Rc tr)(ff(C0) ° T>(g(U))(ilU) 
= Rctr{C t fL4-C t ifi} 
= Retr{[A ) C t ]fi} 
= tr{[A,&] s a} 
= ([A,C}^) 
= ([A,C%U\SW), 

where the last identity explicitly invokes the right- 
invariance of the Ricmannian metric on SU(N), cf. 
Eqn. (|103[) . Now, identifying the above expression with 

Df(U)(ftU) = (grad/(C/)|Of/) (106) 

one gets the gradient vector field 

grad/([/) = [i,Ct]t,[/ (107) 

and thus the gradient system 

U = &*df(U) = -[A,Cft]sU . (108) 

By making use of the Riemannian exponential, see 
Eqns. © and ([TT]). we finally arrive at the discretisation 

U k+1 = e -^iu k Auic] s Uki (109) 
where a k > denotes an appropriate step size. 

Gradient Flows on SU(N) 
Consider a fully controllable system on SU (N) such as 

n 

(£) U(t) = -i(H d + u k (t)H k , a )u{t) (110) 



k = l 



with H d := Yuk<i J uPk,z°Lz, H k ^ a := cr fc;Q , a G {x, y) 
and connected spin-spin coupling topology, cf . [27| . Here 
and in the sequel, 



(111) 



denote the Pauli matrices, which form an orthogonal ba- 
sis of su(2). Moreover, a k . a , a G {x, y, z} is defined by 





'0 i~ 




'0 -1" 




"i 0" 




i 


, <7y : = 


1 




-i 



h ® 



h 



(112) 



where the term a a appears in the fc-th position of the 
Kronecker product and I2 denotes the 2 x 2-identity ma- 
trix. 

As (S) is fully controllable by assumption, the entire 
group SU(N) can be generated by evolutions under the 
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Hamiltonian of the system plus the available controls. 
If A is an initial density operator or a matrix collecting 
its signal-relevant terms, the orbit of the canonical semi- 
group action of (E) on A yields in this case the entire 
unitary orbit O(A) = UAU'. Recall its 'projection' on 
some observable C (or its signal-relevant terms) forms 
the C '-numerical range of A (see Eqn llOO]) . 

In this setting, there are two geometric optimisation 
tasks of particular practical relevance as they determine 
maximal signal intensity in coherent spectroscopy [26j . 

(a) Find all points on the unitary orbit of A that min- 
imise the Euclidean distance to C . 

(b) Find all points on the unitary orbit of A that min- 
imise the angle to the 1- dimensional, complex sub- 
space spanned by C. 



Clearly, the distance 



\UAU* 



c\\i 



\A\\l 



\C\\\ 



2Re{tr (tfUAtf)} 



(113) 



is minimal if the overlap Re{tr (C^UAU*)} is maximal. 
Moreover, making use of the definition of the angle be- 
tween 1-dimensional complex subspaces 



cos 



W UAU\C}) := \^]^}\\ (114) 



Ml 



problem (b) is equivalent to maximising the function 
\ti(C^U AU^)\. Its maximal value is the C-numerical ra- 
dius of A (sec Eqn.HU). Obviously, r c (A) < \\A\\ 2 ■ \\C\\ 2 
with equality if and only if UAW and C are complex 
collinear for some U £ SU(N). Note that the two tasks 
(a) and (b) are equivalent whenever the C-numerical 
range forms a circular disk in the complex plane (cen- 
tred at the origin); co nditio ns for circular symmetry have 
been characterised in [lll| . 

Extending concepts of Brocket t flBl from t he orthogo- 
nal to the special unitary group [26l. |27|. \l!2\ , the above 
optimisation problems (a) and (b) can be treated by 
the previously presented gradient-flow methods, cf. also 

mil. 

For fixed matrices A, C £ £, NxN define 



fx : SU(N)^R, fi{U):= Rctr(C ,t [/AC/ t XH5) 



and 

h : SU(N)^R, f 2 (U):=\tr(&UAtf)\ 2 .(lW) 

Observe that the distance problem (a) is solved by 
maximising f\ , while the angle problem is solved for max- 
imal f 2 . 

Now, the differential and the gradient of f\ with re- 
spect to the bi-invariant Ricmannian metric Eqn. (|103|) 
is precisely given by the previous example as 



Dh(U){W) 
grad/i(£/) 



Retr([[/^C/ t ,C t ]fi), 
[UAU\C% U. 



The differential and the gradient of f 2 can be obtained 
in the same manner as 

Df 2 {U){W) = tx{C^UAU^f •tr([[M[/ t ,C t ]ft) 

- tr(C + C7Af7 t ) • tr([UAU\ C^tfi), 

grad/ 2 ([/) = 2(f 2 (U)* -\UAU\C%U. 
This yields the following result. 

Theorem III. 14. The gradient systems of f v , v = 1,2 
with respect to the bi-invariant Riemannian metric \103\) 
are given by 



U = VL V {U)U 



(117) 



with 



Hi (17) := [UAU\C% and Q 2 {U) := 2(f 2 {U)* -[U AU\C^])\ 

(118) 

respectively. Each solution of converges to a re- 

spective critical point for t — > +oo. Thereby, the critical 
points of f v are characterised by f2„(L/) = 0, v = 1,2. 

Proof: The above computations immediately yield 
Eqn. (|117p . As /„, v = 1,2 are real analytic, the conver- 
gence of each solution to a critical point i s gua ranteed by 
Proposition [JO and Theorem ImTll cf. [nl|. □ 

An implcmentablc numerical integration scheme for 
the above gradient systems making use of the Rieman- 
nian exponential, see Eqns. ([9]) and (jlip . is given by 



Ui% = exp (ar^« } )) l%\ U = 1 N (119) 

A suitable choice of step size s otj > ensuring conver- 
gence can be found in [2(| [U Ill2| . Generically, it drives 

Ilk into final states attaining the maxima of the quality 
functions /„, v = 1,2. However, there is no guarantee 
that the gradient flows always reach the global maxima. 
Standard numerical integration procedures such as e.g. 
the Euler method are not applicable here as they would 
not preserve unitarity. 



Gradient Flows on the Local Subgroup 5 , L r ioc(2™) 

The quality functions introduced in the previous sub- 
section may be restricted to the subgroup of local action, 
i.e. to 



SU loc (2 n ) := SU(2) , 



SU(2) c SU(2 n ). (120) 



n-timcs 

The Lie subalgebra to SUi oc (2 n ) C su(2") can be speci- 
fied by 



su loc (2") := 



1 2 g> Slj <g> 1 2 < 



Qj G su(2) 
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where the term flj £ su(2) appears at the j-th position, 
cf. Eqn. (|112j) . Therefore, the tangent space of SU\ oc (2 n ) 
at an arbitrary element U is given by 



TuSU loc (2 n ) = {W | fi £ su loc (2™)}. 



(121) 



Finally, SU\ oc (2 n ) is endowed with the bi-invariant Ric- 
mannian metric induced by SU(2 n ), i.e. 



(nU,ZU)u :=tr(fitS) 
for QU,EU £ TuSU loc (2 n ). 



(122) 



Lemma III. 3. Let H C GL(N, C) be any closed sub- 
group with Lie algebra f) C gl(N,C) := C NxN . Moreover 
let hi, ... , h m be a real orthonormal basis of f) with re- 
spect to the real scalar product 

( fll | fl2 ):=Retr(4 52 ), g u g x £ C NxN , (123) 
i.e. span R {/ii, . . . , h m } = \) and (hi\hj) = Sij. 
(a) Then the orthogonal projection P : Q — * 



Pg :=^2Reti{h]g}hj 



(124) 



(b) The orthogonal projection P 1 - : C NxN -> C NxN 
onto the orthogonal complement t) 1 - is given by 



g i ^ P^g = g - Pg. 



(125) 



Proof: Both, (a) and (b) are basic and well-known 
facts from linear algebra. □ 

Remark III. 7. For the unitary case, i.e. for f) C su(N), 
the real part in Eqn. (|124|) can be neglected and the pro- 
jector P can be rewritten in the more convenient matrix 
form P as 



P := ^vec(/i j )vcc(/i j ) t , 



(126) 



where the terms vec(hj) vec(hjY represent the rank-1 
projectors Pj = \hj)(hj\ in vec-notation. 

Corollary III. 6. The orthogonal projection P : 
C NxN ->C NxN onto sui oc (2") with respect to (223P is 
given by 

1 n 

P 9 : = ^ E ( Rc (totfX k j)X k + Re (tr( 5 tF fe ))F fc 



fc=i 



Rc ( tr 



(fftz fc ))Z fc ), 



(127) 



where X k , Yfc and Z k are defined by, cf. Eqn. Q112[) 

Xk := (Jfe.a, Y := ak, y , Z := a k ^ z . (128) 



Proof: This follows straightforwardly from the or- 
thogonality of the set {X k ,Y k , Zk \ k = l,...,n} and 
Lemma HES □ 



Theorem III. 15. Let f\ oc be the restriction of ill5)) to 
SU loc (2 n ). 

(a) The gradient of f\ oc with respect H22\) and the cor- 
responding gradient system are given by 

grad/ loc (C/) = P{[C\ UAU^])U (129) 

and 

U = P{[C\UAU^])U, (130) 

respectively, where P denotes the orthogonal pro- 
jection P : 0t(2 n ,C) -> fl[(2",C) onto sui oc (2"). 
More explicitly, tl30\) is equivalent to a system of 
n coupled equations 



(131) 



U k = QkUk, k = l,--- ,n 
on SU(2), where 

n k = i r (Rc(tr([C t ,C/AC/t]tx fe ))X 

+ Re(tr([C t ,C/AC/ t ] t y fe ))r 

+ Rc(tr([C t ,C7 AU^ Z k ))Z 

Each solution of converges for t — > ±oo to 

a critical point of f\ oc . Thereby, the critical points 
are characterised by 



P{[C\UAU^]) = 0. 



(132) 



(b) The Hessian form Hess/i oc (J7) and the Hessian op- 
erator Hess/i oc (C7) of /ioc at U are given by 

Ressf loc (U)(QU,EU) = 

= i(Re(tr(Qt[ct,[H,C/AC/t]])) (133) 

+ Rc(tr(ft t [lM[/ t ,[S, 



(134) 



and 

Hessf loc (U)W = (S(U)Q)U, 
respectively, with ft £ sui oc (2™) and 
S(U)n := (135) 
\p([C\[n,UAtf]} + [UAU\[Vl,tf]]). 

(c) For all initial points Uo £ SU\ oc (2 n ) the discretiza- 
tion scheme 

U k+1 := cxp (a k P([C\ U k AUl\)) U k (136) 
with step size a k := 

\\p(iC<< ,U k Aul])\\ 2 



\\[Ci,P([Ct,U k AUl])}\\ ■ \\[[P([C1,U k AUl]),U k AUl] 
converges to the set of critical points of f\ oc . 



,(137) 
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Proof: The s ubsequent arguments follow our con- 
ference report jll3l | , which also contains a complete proof 
for the flow on the entire groups such as SU{2 n ). 

(a) Since SUi oc (2 n ) is a closed subgroup of SU(2 n ), it 
is also an embedded Lie subgroup and thus a sub- 
manifold of SU{2 n ), cf. Remark |IlL2l Therefore, 
the gradient of /i oc is well-defined by (|36p . Further- 
more, by ([55)) and (|1 1 8[) we obtain 

grad/i oc (t/) = 
= P(grad/i(E0) = P{[UAU\C^)U 
= P({C\UAtf])U, 

where the last equality follows from 
P{[UAW,C^) = -P(jUAUi,&]) and the 
skew-symmetry of the commutator. Moreover, 
Eqn. (|131[) is derived by Corollary IIII.6I and the 
identity 



_d 

di 



(t/i(t)® •••»£/'»(*)) = 



. fc=i 



*{Ui{t) ®---<g> {/„(*)). 



Now, compactness of SU\ oc (2 n ) and real analyticity 
of /ioc imply that each solution converges to crit- 
ical points for t — > +oo, cf. Proposition IIII. ll and 
Theorem HTLH 

(b) By ([U]); the Hessian of f\ oc at U is determined by 
evaluating the second derivative of tp := / o 7 at 
i = 0, where 7 is any geodesies. This yields 



Hess/i oc (t7)(QE/,f2t7) := p"(0) 
= Re(tr(C t [fi, [il,UArf]])), 



(138) 



for fi G suioc(2"). The Hessian then is obtained 
from quadratic form (| 1 38[> by a standard polarisa- 
tion argument, i.e. 



Ressf loc (U)(nU,EU) = 
= i(Rc(tr(C7t[a[S,C/AC/t]])) 

+ Re(tr(C t [S,[r!,[M[/ t ]])) 



Finally, by the identity tr[X, Y]Z = -tvY[X,Z] 
we conclude 

Hess/ loc ([/)(f![/,S£/) = 
= i(He(tr(fit[ C ft J [S,^t]])) 

+ Re(tr(Ot[[/AC/ t ,[S,C t ]])) 



Therefore, the Hessian operator of /i oc at U is given 

by 

Hessf loc (U)W = (S(U)ti)U (139) 
with f2 G sUioc(2") and 
S(U)Cl := (140) 
±p([&,[n,UAUi]] + \UAtf,[il,Cft]]). 

(c) Estimating the second derivative 

ip"(t) =Rc(tr([C\n}[n,e m UAUh- m })) (141) 

for Q := grad/ loc (t/) = P([Ct, UAW]) and [/ G 
St/ loc (2") yields 



<S\t)\ < 


[crt.n] 




[fi.e^t/At/V"] 




[ct,n] 









Therefore, we get the estimate 
d 2 

— 2/i oc (exp c/ (Oi)) 



max 

t>o 



< 



c\n] ■ [n,uAtf] 



for n := grad/i oc ({7). Now, a standard Lyapunov- 
type argument, similar to the proof of Theorem 3.3 
in cf. [21j, yields the desired result. 

□ 

For similar discretisation schemes in different contexts or 
othe r intrinsic Riemannian methods see also [H, H3, HE 

El. 



Double-Bracket Flows on Naturally Reductive Homogeneous 
Spaces 

The well-known double-bracket flows have established 
themselves as useful tools for diagonalising matrices (usu- 
ally real symmetric ones) as well as for sorting lists 
[Ia,IT^.l2lLl22T|. Mo reover, th ey relate to Hamiltonian in- 
tegrateble systems |l!5l . Ill6| . (Note again that in many- 
particle physics gradient flows were later introduced in- 
dependently for diagonalising Hamiltonians [HE [EH-) In 
summarising the most important results we show that 
double-bracket flows can be viewed as special cases of 
gradient flows on naturally reductive homogeneous spaces 
G/H in terms of Sec. IIII Cl where H is a stabiliser group, 
which is typically not normal. Then the homogeneous 
space G/H does not constitute a group itself. 

Let 



O(A) := {UArf\U G SU(N)} 



(142) 
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denote the unitary orbit of some A 6 c WxA \ Note that 
the adjoint action (U,A) i-> Ad^ A := C/AC/t of SU(N) 
constitutes a Ze/t action on the Lie algebra g := <C NxN . 
However, this should not cause any confusion for the 
reader since the key result we refer to — Corollary IIII.4I 
was presented for left actions. 

Let C £ <C NxN be another complex matrix. For min- 
imising the (squared) Euclidean distance \\X — C||| be- 
tween C and the unitary orbit of A we derive a gradient 
flow maximising the target function 



f(X) :=Retr{Ctx} 



(143) 



over X £ O(A). Clearly, this is but an alternative to 
tackling the problem by a gradient flow on the unitary 
group, since as in Sec. IIII Q we have the equivalence 



max f(X) 

xeO(A) v ' 



max f(U) 

uesu(N) 



(144) 



for f(U) := Rctr{C^UAW} 

Building upon Corollary IIII.4[ we have the follow- 
ing facts: O(A) constitutes a compact and connected 
naturally reductive homogeneous space isomorphic to 
SU(N)/U. Here, 



H := {U £ SU(N) | Ad v A = A} 



(145) 



denotes the stabiliser group of A. Recalling that the Lie 
algebra of SU (N) is given by 



su(N) := {SI £ 



^NxN 



& = —SI}, 



(146) 



we further obtain for the tangent space of O(A) at X = 
Ad(j A the from 

T x O(A) = {ad x Sl\Sl£ su(N)} (147) 

Moreover, the kernel of ad A : 



with adx SI := [X, SI 
Su(N) — > g reads 



f) = {f!s su(N) | [A, SI] = 0}. 



(148) 



and forms the Lie subalgebra to H. Now, by the standard 
Hilbert-Schmidt scalar product (Sli, Qg) l—> trjojf^} on 
su(N) one can define the ortho-complement to the above 
kernel as 



(149) 



This induces a unique decomposition of any skew- 
Hermitian matrix 51 = 0''+ Sl p with € F) and 
O p £ p. Finally, we obtain an Ads[/(./v)-invariant Rie- 
mannian metric on O(A) via 

(adx(Ad u Sl 1 )\ad x (Ad u Sl 2 ))x := tr{of^} (150) 



for X := Adu A, which is equivalent to saying 



(aA x (Q 1 )\aA x (Q 2 ))x ■= tr{fi? xt f2^} 



(151) 



with px ■= Adu p. Now, the main results on double- 
bracket flows read as follows: 



Theorem 111.16. Set f : O(A) -> K, f(X) := 
Retr{C^A}. Then one finds 

(a) The gradient of f with respect to the Riemannian 
metric defined by Eqn. (| 150[) is given by 



grad/(X) = [X, [X,&] s ], 



(152) 



where [X, C']g denotes the skew-Hermitian part of 
[*,Ct]. 



(b ) The gradient flow 

X = grad /(A) = [X, [X,tf] s ] 



(153) 



defines an isospectral flow on O(A) C Q. The solu- 
tions exist for all t > and converge to a critical 
point Xoa of f(X) characterised by LY^C^s = 0. 

Proof. (A detailed proof for the real case can be 
found in [2l[; for an abstract Lie algebraic version sec 
also [3.) 

(a) For X = Ad v A and £ = ad* SI £ T x O(A) we 
obtain 



Df(X)ad x Sl 



^Retr{CV n Ae m } 
Retr-tC 1 adx^}. 



t=o 



Therefore, the gradient of / has to satisfy 

RetrjC 4 &d x Sl* x } = (grad f(X)\ &d x Sl t ' x )x 

for all Sl px £ p x - Applying Eqn. (fT50|) to X = A gives 

RctrjC* 1 ad A Sl p } = tr{r pt ft p } 

for all Sl p £ p, where T p is defined by grad f(A) = ad A T p 
with T p £ p. Thus, we finally arrive at 

tr{(ad At C)lSl*} = tr{r pt ft p } 

for all Sl p £ p, where (ad A t C)g denotes the skew- 
Hermitian part of ad A t C. Hence, r p = (ad A t C) p s . 
Moreover, for SI* 1 £ f), we have 



tr{(ad At C)^"} 



-tr{ad A tfStf} 
tr{& ad A Sl h } = 0. 



Hence, (ad A t C)s £ P and therefore 

grad /(A) = ad A (ad At C) s = [A, [A,tf] s ]. 
The same arguments apply to X = Adu A and thus 
grad /(A) = [X, [X,C*] S ]. 



(b) Since Eqn. Q152p evolves on the unitary orbit of A, 
the associated flow is isospectral by construction. The 
compactness of O(A) then implies that each solution 
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X(t) of Eqn. (|152[) exists for all t > and converges 
to the set of critical points cf. Proposition IIII.ll More- 
over, from Theorem MI. 1 1 we derive that X(t) converges 
actually to a single critical point X^ of /, i.e. to a point 
Xoo which satisfies 

[X oo ,[X oo ,tf} s }=0. (154) 

Since [XoojO^jg € Px x , Eqn. (|154[) is equivalent to 

[^00,^5 = 0. 

□ 

In order to obtain a numerical algorithm for maximis- 
ing / one can discretise the continuous-time gradient flow 
(|152|) as in the previous examples via 

X k+1 = e - a ^ Xk < c ^ s X k e °*[*».c*] s ( 155 ) 

with appropriate step sizes a k > 0. Note that Eqn. Q155P 
heavily exploits the fact that the adjoint orbit O(A) con- 
stitutes a naturally reductive homogeneous space and thus 
the knowledge on its geodesies, cf. Corollary IIII. 41 

Remark III. 8. As an alternative to Eqn. 11551 taking 
the standard Eulcr-typc iteration 

X k+1 =X k + a k [X k , [X k , C%] (156) 

does not retain the isospectral nature of the flow. There- 
fore, it should only be used as a computationally inexpen- 
sive, rough scheme in the neighbourhood of equilibrium 
points, if at all. 

For A, C complex Hcrmitian (real symmetric) and the 
full unitary (or orthogonal) group or its respective orbit 
the gradient flow (|152[) is well understood, cf. Corollary 
IIII. 81 However, for non-Hermitian A and C, the nature 
of the flow and in particular the critical points have not 
been analysed in depth, because the Hessian at critical 
points is difficult to come by. Even for A, C Hcrmitian, a 
full critical point analysis becomes non-trivial as soon as 
the flow is restricted to a closed and connected subgroup 
K C SU(N). Nevertheless, the techniques from Theo- 
rem IIII. 161 can be taken over to establish a gradient flow 
and a respective gradient algorithm on the orbit Ok hi 
a straightforward manner. 

Corollary III. 7. The gradient flow (]152[) restricts to the 
subgroup orbit O k (A) := {KAK^ \ K G K c SU(N)} 
by taking the respective orthogonal projection Pi onto the 
subalgebra t C Su(N) o/K instead of projecting onto the 
skew-Hermitian part, i.e. X = [X, Pt[X, C']]. 

The corresponding discrete integration scheme takes the 
form 

X k+1 = e-^JM**,^] Xk e a k P t lx k ,^] (157) 
with appropriate step sizes a k > 0. 



In view of unifying the interpretation of unitary net- 
works, e.g., for the task of computing ground states of 
quantum mechanical Hamiltonians H = A, the double- 
bracket flows for complex Hermitian A, C on the full 
unitary orbit O u (A) as well as on the subgroup orbits 
Ok(A) for different partitionings K := {K G SU(N\) ® 
SU{N 2 )®---®SU(N r )\Y[ r j=1 Nj = 2"} have shifted into 
focus [36|. Therefore, we have given the foundations for 
the the recursive schemes of Eqns. (j!55|) and (|157p . which 
are listed in Table HJ as U1P and U1KP. 

Finally, we summarise what is known about the na- 
ture of critical points for the real symmetric or complex 
Hermitian case. For a detailed discussion of the real sym- 
metric case and the orthogonal group see e.g. [2lj . 

Corollary III. 8. Let C and A be real symmetric or com- 
plex Hermitian and assume for simplicity that they show 
distinct eigenvalues in either case. Then one finds: 

(a) For A, C real symmetric, define with respect to the 
special orthogonal group SO(N) and Y G (A) := 
{OAO T \0 G SO(N)} a pair of target functions on 
the group and on the respective orbit by 

g(0) := tr{C T OAO T } (158) 

g(Y) := t r {C T Y} . (159) 

Then the gradient flow 

6 := grad g(0) = [OAO T , C) O (160) 

shows 2^ N ~ lS> N\ critical points, while the double- 
bracket flow 

Y:=gradg(Y) = [Y,[Y,C}] (161) 

only shows N\ equilibrium points. 

(b) For A,C complex Hermitian, and X G O u (A) := 
{UAW\U g SU(N)} 

f(U) := tr{tfUAtf} (162) 

f(X) := tr{CtX} (163) 

the gradient flow on the special unitary group 
SU(N) 

U := grad f(U) = [UAU\ C]U (164) 

shows a continuum of critical points, while the 
double-bracket flow on the unitary orbit 

X:=gcadf{X) = [X,[X,C\] (165) 
again shows only N\ equilibrium points. 

(c) On the orbit, the respective target function has a 
unique global maximum which is given by the di- 
agonalisation diag (Ai, . . . , Xn), Ai > ■■• > Xn 
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of A, if C is assumed to be diagonal of the form 
C = diag (/xi, . . . , [In), Mi > ' ' ' > Mat- Moreover, 
the respective gradient flow converges to the unique 
global maximum for almost all initial values with 
an exponential bound on the rate. 

Proof. 

(a) and (b) The counting arguments follow immedi- 
ately from the fact that in either case for G diago- 
nal with distinct eigenvalues, the set of critical points 
Coo := {!«, e 0{A)\[X oo ,C] = 0} on the orthogonal or 
unitary orbit is given by N\ different diagonalisations of 
A and remains therefore invariant under conjugation by 
any permutation matrix. 

Moreover, on the orthogonal group O(N), the sta- 
biliser group of A is given by 

{diag(±l,±l,...,±l)} , 

which adds 2 N independent further degrees of freedom. 
Finally, restricting to SO(N) we obtain 2 N ~ 1 N\ critical 
points on the group level. 

In contrast, for the unitary case SU(N), the stabiliser 
group of A reads 

N 

{diag(e i ^,...,e i ^,...,e i< >") | ^<j> v G 2^Z, ^el), 

v=X 

which is always continuous. 

(c) Since G is symmetric or Hcrmitian, we can assume 
without loss of generality that G is diagonal. Then, the 
critical point condition [X^ , G] yields that the critical 
points of g and, respectively, / are given by the diagonal- 
isations of A. Moreover, analysing the Hessian at critical 
points shows that there is only one global maximum in 
both cases and no local ones [21j . 

The exponential convergence of the gradient flows 
Eqns. (|16ip and (|165|) to the respective unique global 
maximum for almost all initial values is also established 
via the Hessian, i.e. by linearising the respective gradient 
flows at critical points [2l[. □ 

Gradient Flows on Naturally Reductive Coset Spaces 

More generally, let G be a Lie group with bi-invariant 
metric and let / be an equivariant quality function with 
respect to the closed subgroup H, i.e. for all H G H one 
has /(G) = f(HG), so 

/|hg = constant 

for every G G G. Moreover, assume that G/H is a 
naturally reductive coset space. Establishing a gradi- 
ent method for the induced quality function / on G/H 
finally yields a recursion scheme which looks like the cor- 
responding one on the group level, cf. Theorem IIII.11I 
and 1111.131 This, however, is not surprising, as the equiv- 
ariance of / guarantees that the gradient (taken on the 



group level) at G G G is orthogonal to the coset HG. 
Thus, its 'pullback' to the Lie algebra automatically be- 
longs to p inducing a gradient flow on G/H. — This can 
be illustrated as follows. 

With G/H being naturally reductive, there is the re- 
ductive decomposition g = f) © p with p :~ t)- 1 , so any 

G g decomposes uniquely into fi = ffi + iW. Then, by 
the cquivariancc of / one finds 

(grad/(G)|ft"G) = D/(G)O"G = 

for all Sl^ G f). Therefore, the 'pullback' of the gradi- 
ent of / to q satisfies grad /(G)G -1 G p. Furthermore, 
combining Eqns. (|TTj) . ([§5]) and the identity 

D(noZ G )(l)ft = Dn(G) GQ 

for all O G Q (cf. Remark |III.3[) yields 

grad/([G]) =D(no/ G )(l)(G- 1 grad/(G)). (166) 

Thus, from Eqn. (|53"|) we finally obtain 

cx P[G] (tgrad /([G])) = [cxp (tgrad/(G) G" 1 ) G] 

for all i £ R, where expj G ] denotes the Riemannian expo- 
nential map at [G], cf. Eqns. (p~0|) and (fTTj) . — This pre- 
cisely explains what we meant by the above statement 
'looking like the one on the group level'. 

Example Let / : SU(2 n ) — > R be an arbitrary smooth 
function that is equivariant under local unitary opera- 
tions of the n,-fold tensor product SU\ oc (2 n ) := SU(2) Cg> 
■ ■ -(E>SU(2). This includes, e.g., any measure of entangle- 
ment /i e{U) that varies smoothly with U . Since by equiv- 
ariant construction grad/|5[/ loc ( 2 n) = 0, we may consider 

the flow to [U] = grad f{\U]) on the homogeneous space 

G/K = SU(2 n )/SU loc (2 n ) , 

which is naturally reductive for all n and Cartan-like only 
for n = 2. This can be seen, because (i) SU{2 n ) carries a 
bi-invariant metric induced by the Killing form allowing 
to define p := f- 1 , which gives the reductive decompo- 
sition q = i © p, yet only for n — 2 one recovers the 
commutator inclusions [6, t] C 6, [p,p] C and[{!,p] C p; 
(ii) in any case, by Prop. IITL41 there is an Adjc-invariant 
scalar product on p; and (iii) Eqn. (|HU|) is fulfilled for all 
{a,b,c} C p, as tr{[a,6]^c} = — tr{V[a,c]} , cf. Remark 
IIII. 61 - Clearly, this example generalises analogously 
to functions that are equivariant under actions of other 
partitionings of the full unitary group giving flows on the 
corresponding reductive homogeneous spaces 

G/K = SU(N)/(SU(N 1 ) ® SU(N 2 ) $ • • • $ SU(N r )) 

withn;=iJVi=tf. 

Moreover, it is important to note that quality functions 
/ directly defined on G/H (without resorting to equiv- 
ariance) can be handled with the very same techniques 
as above, once the relation 

grad/([G])=D(noZ G )(l)Ad G -iQ< 3 
for some J7 P G p is established, cf. Eqn. (|166[) . 



27 



IV. APPLICATIONS TO QUANTUM CONTROL 
AND QUANTUM INFORMATION 

A. A Geometric Measure of Pure-State 
Entanglement 

The Euclidean distance of a pure state to the set S pp of 
all pure product states m ay be seen as a geometric mea- 
sure of entanglement (54l . Ill7l . Ill8l ] . Since S pp coincides 
with the local unitary orbit 



OiocO/Z/ 1 ) := {Uyy^rf \ U £ SU loc (2 n )} 



(167) 



of any pure product state y £ <S PP , it relates to the fol- 
lowing optimisation task 

A(x) := min Ibx 1 - Uyy ] U ] \\ 2 , (168) 

where x £ C 2 denotes a normalised pure state and y £ 
C 2 a pure product state, e.g. y — (1,0,..., 0) T = (ei ® 
■ • • ® ei). This notation replaces |ar) by x and |a;)(x| by 
xa;^ for the sake of convenient generalisation to higher 
order tensor products. Obviously, minimising (|168|) is 
equivalent to maximising the so-called local transfer 



max 

UGSU ioc (2'- 



Re(tr(xx*UyyW)), (169) 



between xx* and yy* . Further, since 

tr(xx^ Uyy^tf) = \ tv(x^Uy)\ 2 

taking the real part in (|169p is redundant. 

Now, the techniques developed in Section fill Dl match 
perfectly to tackle problem (|169|) . Let C := A := 
diag (1, 0, . . . , 0) and define the so-called local unitary 
transfer between C and A by the real-valued function 



f loc (U) :=tr (CUAtf) 



(170) 



Then the gradient flow (|130|) or more precisely its dis- 
cretisation (|136p will gcnerically solve (|169p. For exp licit 
numerical results sec Subsection HVB] and fH3 . [TTgj . 

In general, neither an algebraic characterisation of the 
maximal value of f\ oc nor the structure of its critical 
points is known, the major difficulty arising from the 
fact that U is restricted to SU\ oc (2 n ). As soon as U may 
be taken from the entire special unitary group, the so- 
lution is well-known: it is simply obtained by arranging 
the (real) eigenvalues of both A and C magnitude-wise 
in the same order [H, HI, EH OH HH . 



systems are particularly simple in as much as the maxima 
of /i oc can be linked to the singular- value decomposition 
(svd) of the matrices X and Y associated to x and y by 
x := vecX and y := vccY. Since these ideas readily ex- 
tend to arbitrary finite dimensional bipartite systems, we 
generalise the formulation of Problem (|169[) thus leading 
to Eqn. (|17ip . before going into multi-partite systems. 

Proposition IV. 1. Let X = V X E X W X> Y = V Y E Y W Y 
be singular value decompositions with Vx,V Y £ U(iVi), 
Wx,Wy £ V(N 2 ) and Sx,Sy sorted by magnitude. 
Moreover, let x :— vccX and y := vccY . Then the 
maximum value of the local transfer between ra' and yy^ 
is bounded by 

max Rc ( trfxx^Uyy^U 11 )) 

(7GSU(A r 2 )«iSU(JV 1 ) (171) 

< (trS^-Ey) 2 . 

Equality is actually achieved for Vx,V Y £ SU(iVi), 
W x , W Y £ SXJ(N 2 ) and U* := (W* x ® V x ) ■ (W? ® V Y ). 



Proof: 

obtain 



For U 



W ® V £ SU{N 2 ) ® SU(N!) we 



(172) 



tr (xx^Uyy^U^) = 
= tr(xx\W ®V)yy\W^ ®V^)) 
= tr (xx^ vec(VYW T ) vec(VY VF T ) f )) 

= \ix(X*VYW T )\ 2 . 

Here, we have used the identities 

vec(VYW) = (W T ® V) vecF, 
(vecX)UecY = trX^Y 

for all X,Y £ C NlxN2 . Now, (JT72J) implies 

max Retr(a;x' Uyy^U^) = 
c/esu(Ar 2 )®su(JVi) 

= max \tr(X^VYW T )\ 2 < (tr£t£y) 

vesu(N 1 ) 1 ' 
wesu(jv 2 ) 



whe re the last inequality is due to von Neumann, cf. [1071 . 
[Hj. IfV x ,V Y £ SU(iVi) and Wx,W Y £ SXJ(N 2 ), equal- 
ity is assumed in Eqn. (|173[) for 



(173) 



{W Y W X ) T ® VxV Y = (W x ® V x ) ■ (W? ® V Y ). 



□ 



B. Generalised Local Subgroup Optimisation 

Bipartite Systems and Relations to Singular- Value 
Decompositions 

An exceptional case, where the restricted problem 
(I169[) can be solved are bipartite pure systems. These 



Corollary IV. 1. Set x := vecA and y := vccC. Then 
the maximum local transfer between xx* and yy' in the 
sense of Provosition UV . 1\ is bounded by 

\\A\\ 2 C := ^max \ix(tfVAW*)\ 2 , 

"Weu(iV2) 

which is known as the C -spectral norm of A, cf. llOA] . 
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Note that in the context of finding maximal distances 
between global unitary orbits for the purpose of geo metri c 
discrimination of generic n on-pure quantum states |l23l | , 
results similar to [l2ll . 1 1221 ] show up, while here we treat 
local unitary orbits of pure bipartite states as explicit in 
Eqn. (fT7l|) . 



Multipartite Systems and Relations to Best Rank-1 
Approximations of Higher Order Tensors 



Proposition IIV.1I has a straightforward generalisation 
to multipartite systems, which relates to best rank-1 ap- 
proximations of higher order tensors. To outline this rela- 
tion, we define the concept of a generalised local subgroup 



SUi oc (N 1 ,...,N r ) :=SU(JVi). 



SU(iV r ). (174) 



of type (JVi, ...,N r ) with N k G N, k = 1, . . . ,r. Thus, 
the associated general local subgroup optimisation Prob- 
lem can be stated as follows. 

Generalised Local Subgroup Optimisation 
Problem (glsop) 

For C,Ae C NxN with N :— Ni-N 2 - ■ ■ N r find 

max Re(tr(CE/AL/ t )). (175) 

£/eSi/i 00 (iVi,...,iV r ) v ' 

To our knowledge, the GLSOP seems to be unsolved so 
far. To introduce higher order tensors, we have to fix 
some further notation. For simplicity, we regard a tensor 
of order r G N as an array 

X = (-Xi 1 ...* r )l<ti<JVl 1 ...,X<i r <iV r . 

of size N\ x • • • x N r . The space of all N\ X • • • X iV r -tensors 
is denoted by C Ar i x ' XJV '-_ j± na tural scalar product for 
tensors of the same size is given by 



(Y\X) := J2 K...i r Xi,. 



(176) 



Let || ■ || denote the norm induced by scalar product (11761) . 

For X G C JVlX '" xJVr solve 



mm 

crec, ||x*u 
*=i > 



\X-C-x 1 



® a x 



(178) 



Note that the above notation ® a is necessary to distin- 
guish between two different types of outer products: the 
Kroneckcr product (g> (of column- vectors) , which maps r- 
tuplcs of column- vectors to a column- vector of larger size, 
and the 'abstract' outer product <£> a , which maps r-tuples 
of column-vectors to arrays (= tensors) of order r. The 
relation between both is given by the canonical isomor- 
phism Dec : C Nl *-* N * -► with N := N 1 -N 2 ...N r , 
which is uniquely determined by 



x 1 <X> a x 2 



x 1 ® x 2 



®---®x r 



(179) 



i.e. oec assigns to each array X G C lX "' x r a column- 
vector in C N by arranging the entries of X in a lexico- 
graphical order. With these notations at hand, the rela- 
tion between glsop and brap can be stated as follows. 

Theorem IV. 1. Let X G C^ 1 *"*^ be a tensor of or- 
der r and let x := Vec(X) G C N with N := JVi- N 2 - ■ ■ N r . 
Then the BRAP is equivalent to the GLSOP 



max 

uesUi oc (N u 

-N 



Re (tr (xx^Uyy^U^)), 



(180) 



where y G C can be any pure product state, e.g. y 



(1,0,. ..,0)' = ei 

(a) If XJ X ® ■ ■ ■ (8 
f/fcei, k = 1 
solve (775p . 



g) • • • (g) e±. More precisely, 

U r is a solution of M80\) then x k := 
. . . , r and C := {X\x x <X> a • • • <8> a x r ) 

(b) IfCeC and x k , k = 1, . . . , r solve {TTSp then any 
U\® • • ■ ®U r with x k — Uke±, k = 1, . . . ,r yields a 
solution of 

For proving Theorem lIV.ll we need the following technical 
lemma. 

Lemma IV. 1. The pair (x 1 ® a ---® a x r ,C) solves {773p 
if and only if x 1 <8> a • • • <8 a x r is a maximum of 



max 
=i,fc=i, 



\iX\z 1 ® a ■ ■ ■ ®a Z r }\ 



(181) 



Moreover, a tensor X is called a rank-1 tensor if there 
exist x k G , k = 1, . . . , r such that 



X = X 1 ® a X 2 ® a - ■ ■ ® a . 



(177) 



where the (i\ . . . i r )-entry of the outer product ® a is de- 
fined by 

(x <8 a X ® a • ■ • C^a X )i 1 ...7 r '■— X^ 1 • X^ • • • . 

Thus, the question of decomposing a given tensor by 
tensors of lower rank leads to the following fundamental 
approximation problem: 

Best Rank-1 Approximation Problem (brap) 



and C = {X\x x ® a ■ ■ ■ ® a x r ). 

Proof: Consider the following identity 



\X -C-z 1 



= \\x\ 



|C| 2 -2Re(C*(X|z 1 



\X\\ 2 +\C-(X\z 1 ® a ---® a z r }\ 



-\iX\z 1 
Thus, we obtain 



mm 

C€C,||«fe||=l 



= \\XP- 



\x 



%.z r )\ 



C-z 1 



la Z T )) 



max 

||**>||=1 

k = l,...,r 



\iX\z 1 
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This yields the desired result. 



Proof of Theorem IIV.lt Let y = e\ (8> • • • £ 

Then 

(Ui <g> • • • (8) ?7 r )j/ = (t/iei) <g> • • • «) (t/ P ei) 
and thus 

tr(xx* UyyW) = tr(a; t C/yy t C/ t a;) 

= \(X\(U iei )® a ■■•<g> a (C/^ex))! 2 
Therefore, we obtain 

max Re (tr^a^t/yu^C/^)) = 

I7£Sl7 1 oo(JVi,...,JV r ) v v 

max |(^|(^iei)® a ■ ■ • <8 a (E/ r ei)}| 



□ 
ei- 



max kXjz 1 (g) a • • • <8> a z r }\ 2 . 



and hence Lemma HV. 1 1 implies (a) and (b). 



□ 



Remark IV. 1. 1. The isomorphism Dec coincides 
'almost' with the standard vec-operation on ma- 
trices for r = 2, more precisely titc(X) = vec(X T ). 

2. Since any phase factor can readily be absorbed into 
3 a x r , it is easy to show that 



X 1 <X> a 



max 

=i,fc=i, 



\iX\x 1 ®a---®aX r )\ 



max Re ((Xlx 1 (g) a ■ • • <g> a x r )). 

=l,fe=l,...,r 



Therefore, maxima of the 'real-part-expression' on 
the right-hand side are always maxima of the 
'absolutc-value-term' on the left. 

3. By replacing yy< in (|180[) with an appropriate sum 

X)i=i Villi } ^ ne aDOVC ideas can be extended to best 
approximations of higher rank, i.e. to best approx- 
imations of the form 



min \\X - Y] d ■ x 1 ' 1 (g> a ■ ■ ■ (g> a x l ' r \ 



with I < minjiVi, . . . , N r } and all x 1 ' 1 ® a • • • <8fa x l 



mutually orthogonal, cf. |l24| 



4. Unfortunately, an analogue of Proposition [IVJ] in- 
volving the tensor SVD as defined in [l25| | does not 
hold for higher order tensors. Even the classical 
Eckart- Young Theorem, which asserts that the best 
rank-fc approximation of a matrix is given by its 
tru ncat ed SVD, is false for higher order tensors, 
cf. fl26l. 



Numerical Results 

For comparing our gradient-flow approach to tensor- 
SVD techniques, here we focus on two examples that are 
well-e stabl ished in the literature, since analytical solu- 
tions [127] as well as numerical results from semidefi- 
nite programming are known [54| . First, consider a pure 
3-qubit state depending on a real parameter s G [0, 1] 



(182) 



and 



\X(s)):=^\W)+Vl=~s\V) , 
where one defines 

\W) := -L(|001) + |010) + |100)) 
\V) := ^(|110) + |101) + |011 



with the usual short-hand notation of quantum informa- 
tion |0) := (i), |1) := (?) and |001) := (J) ® (J) ® (?) 
etc. With these stipulations one finds the corresponding 
2x2x2 tensor representations for \W) and \V) to take 
the form 



and 
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V3 



1 

1 




1 



W(2,;,:) = -4 



1 
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v, 



(2.: 



1 



1 
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Likewise, observe the pure 4-qubit-state 
\X(s)) := yG\GHZ') - VT~1 \X+) <g> 
with the definitions 



IX- 



(183) 



(184) 



(185) 



IGHZ') 
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i 

V2 



|0011) + |1100) 



\X+) := -U|10> + |01) 



Consider the target function f(K) = tr{C t KAK^} 
with C = diag (1, 0, 0, . . . , 0) and A := \X(s))(X(s)\. As 
shown in Fig. O with the gradient flow restricted to the 
local unitaries K e SU\ oc (2 n ) one obtains resu lts per- 
fectly matching the analytical solutions of |127| as well 
as the numerical ones from semidefinite programming en- 
suring global optimality — yet in drastically less CPU time 
as compared to [54|, see Table Q] Gradient flows are some 
30 to 150 times faster in CPU time than semidefinite pro- 
gramming methods for the 3-qubit and 4-qubit example, 
respectively. 

In the tensor-SVD algorithms [l26l ] such as the higher- 
order power method (hopm) or the higher-order orthogo- 
nal i terat ion (hooi) as implemented in the MATLAB pack- 
age [l28| . N = 50 to N = 60 iterations are required for 
quantitative agreement with the algebraically established 
results. In the 3-qubit example, all minimal distances are 
also reproduced correctly with N = 5 iterations-except 
for the limiting values s near and near 1, for which the 
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Table I: CPU Times for Determining the Euclidean Distance to the Orbit of Separable Pure States as in Fig. [5] 



no. of 


semidefinite 


gradient flow 


tensor-SVD I (hopm) [126. 1281 


tensor-SVD II (hooi 


) [126, 128] 


qubits 


programming 


on local unitaries 


higher-order power method 


higher-order orthogonal iteration 




CPU time [see]" 


CPU time [sec] 6 speed-up 


CPU time [sec] 6 speed-up 


CPU time [sec] 6 


speed-up 


3 


10.92 


0.30 36.4 


2.39 4.6 


5.37 


2.0 


4 


103.97 


0.71 147.0 


3.93 26.5 


7.03 


14.8 



"Eisert et al. (processor with 2.2 GHz, 1 GB RAM) [53 
6 average of 50 runs, Athlon XP1800+ (1.1 GHz, 512 MB RAM) 
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Figure 5: (Colour online) Numerical results by gradient 
flow on the local unitary group K = SUioc{2 n ) determin- 
ing (a) the Euclidean distance of the 3-qubit state l-X'(s)) = 
\fs\W) + \f\ — s\V) (see Eqn. I182p to the nearest product 
state as a function of s; (b) the distance of 4-qubit state 
l-^M) = v^lGHZ') - v / T^|X+) ® |X+)(see Eqn. \W$ to 
the nearest product state. 



minimal distances of A(\X{0))) = A(|X(1))) = 2/3 arc 
obtained by either tensor method instead of the correct 
analytical value of 5/9, which requires N = 60 itera- 
tions as shown in Fig. [SJ In the 4-qubit example, how- 
ever, for N = 5 iterations, both tensor methods suffer 
from apparently random numerical instabilities, which 
only vanish when allowing for N — 50 iterations in cither 
method. It is the considerably high number of iterations 
that makes the tensor methods substantially slower than 
our gradient-flow algorithm as shown in Tab. U 

Therefore, at least for lower order tensors, gradient 
flows provide an appealing alternative to tensor-SVD 
methods for best rank-1 approximations of higher order 
tensors. Moreover, one should take into account that 
the above gradient methods are developed to solve the 
GLSOP and thus a considerable speed-up can be expected 
by adjusting them to the local orbit O\ oc (yy ') of a pure 
product state. For a similar result obtained by an in- 
trinsic Newton and conjugated gradient method see also 
jll9j ] . — We anticipate that these numerical approaches 
will prove useful tools in tensor and rank aspects of en- 
tanglement and kinem atics of qubit pairs as addressed, 
e.g., in Refs.dMIIIi 
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Figure 6: (Colour online) Tensor-SVD results for Euclidean 
distance of the 3-qubit state \X(s)) to the nearest product 
state as in Fig. EJa). With the standard of N = 5 iterations, 
both methods (here shown for hopm) give systematic errors 
as indicated by the arrow. iV = 60 iterations are needed 
for quantitatively matching the well-established distance val- 
ues. The high number of iterations required slows down the 
method as indicated in Tab. [T] 



C. Locally Reversible Interaction Hamiltonians 

Joint Local Reversibility 

In a recent study [28| . we have addressed the deci- 
sion problem whether a time-independent (self-adjoint) 
Hamiltonian H normalised to ||-ff|| 2 = 1 generates a 
one-parameter unitary group U(t) = {e~ ltH \t £ M.} that 
is jointly invertible for all t by local unitary operations 
K e SU loc {2 n ) = SU(2)® n in the sense 



KHK^ = -H 



(186) 



Apart from complete algebraic classification, in [28|| we 
used that the question obviously finds an affirmative an- 
swer, if there is an element K £ SU\ oc (2 n ) such that 

\\KHK*+H\\ 2 = , (187) 

which amounts to minimising the transfer function 

f{K) = Re %i{HKHK^} . (188) 
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With P denoting the projector onto t, i.e. the Lie algebra 
of K = SUi oc (2 n ), we therefore used the gradient flow 

K = -gcadf{K) = -P{[KHK\H\) K (189) 

as an other application of Theorem IIII.15I If (due to 
normalisation) Re tr{HKHK'} = — 1 can be reached, 
the interaction Hamiltonian is locally reversible. 

Remark IV. 2. There is an interesting relation to loca l 
C-n umerical ranges as described in detail in Refs. [l09l 
IllOj : if the local C-numerical range 

W ioc {H,H) := {tiiHKHK- 1 )^ £ K} = [-1;+1] 

then the interaction Hamiltonian H is locally reversible. 
The references also establish the interconnection to lo- 
cal C-numerical ranges of circular symmetry and multi- 
quantum interaction components transforming like irre- 
ducible spherical spin tensors. 

In Fig. [7J we give some examples: e.g., the Hciscn- 
berg ZZ interaction in a cyclic four-qubit coupling topol- 
ogy is locally reversible, while in the cyclic three-qubit 
topology or for the isotropic XXX interaction it is not. 
Thus numerical tests provide convenient answers in prob- 
lems where an algebraic assessment becomes more te- 
dious than in these examples, which are fully understood 
on algebraic grounds [28|. 

Pomtwise Local Reversibility 

In [28| we also generalised the above problem to the 
question, whether for a fixed t 6 1 there is a pair 
Ki, K 2 e K = SU loc {2 n ) so that 

K ie - ZTH K 2 = e +trH (190) 
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Figure 7: (Colour online) Gradient-flow driven local reversion 
of different Heisenberg interaction Hamiltonians: (a) the ZZ 
interaction on a cyclic four-qubit topology Ci can be locally 
reversed, (c) the ZZ interaction on a cyclic three-qubit topol- 
ogy C3 cannot be reversed locally, (c) nor the XXX isotropic 
interaction between two qubits. 



which upon setting A := e %tH and C := e +lrH is equiv- 
alent to 

\\K r AK 2 -C\\ 2 = . (191) 

Thus one may choose a gradient flow to minimise 

f{K u K 2 ) := Re tr{C^K 1 AK 2 } (192) 

by the coupled system 

K x = gr a df(K 1 )=P(K 1 AK 2 tf)K 1 (193) 
K 2 = grad f(K 2 )=P(K 2 C^K 1 A) K 2 . (194) 

So if f(K\,K 2 ) = -1 can be reached, then U(t) = e~ lrH 
is locally reversible at time t = r. See Fig.[8]for examples 
comparing pointwise and universal local reversibility. 



D. Constrained Optimisation in Quantum Control: 
Intrinsic vs Penalty Approach 

In practical quantum control, one may face the prob- 
lem to maximise a quality function / on the reachable set 
of a quantum system under additional state space con- 
traints. For instance, find the maximal unitary transfer 
from matrix (state) A to C subject to leaving another 
state E invariant (provided A and E do not share the 
same stabiliser group). Another variant amounts to op- 
timising the contrast between the transfer from A to C 
and the transfer from A to D; so the task is to max- 
imise the transfer from A to C subject to suppressing 
the transfer from A to D. 

For tackling those types of problems, we address two 
basically different approaches — a purely intrinsic one and 
a combined method joining intrinsic and penalty-type 
techniques. Both methods will be briefly illustrated for 
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Figure 8: (Colour online) Gradient-flow driven local inversion 
of exponential of the Hamiltonian H = \ {o~z <8> 1 + 1 ® cr z + 

o~ z ® (Tz) and U(r) :— e~'^ H (a) by a gradient flow with K\ 
and K2 (b) by a gradient flow with K\ = K\ =: K. 
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Summary: General Gradient Algorithm for Steepest Ascent on Riemannian Manifolds 

Requirements: Riemannian manifold M, e.g. Lie group G with (bi-invariant) metric (-|-) or its group orbits; smooth 
target function / : M — > R; associated gradient system X = grad/(X). 

Input: initial state X(0) £ M, parameters for target function. 

Output: sequence of iterative pairs {{Xk, f(Xk))} approximating critical points X, and their critical values f(X*). 

Initialisation: If possible, gen erate generic initial state Xo, e.g. for compact Lie groups pick random Go £ G according to 
Haar measure (for SU(N) see [l3ll |) and set Xq := Gq-X(0), otherwise identify Xq := X(0); calculate f(Xo), grad/(Xo), 
and step size ao according to Section Hill 

Recursion: 

while k — 0, 1, 2, . . . , fc Umit and a k > a thlcshold > do 

1: iterate Xk+i = exp Xk (at grad f(Xk)) (see collection of examples in Tab. HH . 
2: calculate f(Xk+i) ■ 

3: update step size ctk+i according to Section [TTT1 
4: go to step 1. 

end 



Figure 9: Summerising scheme for steepest-ascent gradient flows on Riemannian manifolds. For related methods, like conjugate 
gradients, Jacobi- or Newton-type schemes, step (1) has to be modified in a straight-forward way according to Sec. [TTJ for 
details see Refs. [3, loll . |62| . If the dynamic stepsize selection of Sec. Mil is too costly CPU-timewise, one may start out with 
constant stepsizes, and halve them whenever (f(Xh+i) — f(Xt)) < 0, cf. Armijo's rule. In cases, where local extrema exist (see 
Sec. IIII[1 . make sure to run with a sufficient number of generic initial conditions. 



the problem of maximising the transfer from A to C while 
leaving E invariant, i.e. 

^max. \te{UAU ] &}\ subject to UEW = E. (195) 

It is straightforward to see that the stabiliser group 

K E := {K G U{N) | KEK* = E} (196) 

of E forms a compact connected Lie subgroup of U (N) . 
Differentiating the identity e tk Ee~ tk = E for t = yields 
its Lie algebra 

t E := {k G u(N) | ad fe (^) = [k, E] = 0}. (197) 

By the Jacobi identity 

[[A,B],E] + [[B,E},A} + [[E,A},B} = (198) 

for all A, B, E e C NxN , one can easiliy verify that 6g is a 
Lie subalgebra of u(N). Moreover, from the compactness 
of Kb we conclude that not only does Ie generate the 
stabiliser group in the sense (exp ({!#)) = Kb, but 
also does so in the stronger sense exp(ts) = Kg. 

A set of generators of Ie may constructively be found 
by solving a system of homogeneous linear equations, i.e. 

l E = kerad B nu(iV) 

(199) 

= {k e u(N) I (1 (g) E - E T <E> 1) vec(fc) = 0} . 

In particular, if E is of the form E = fit + fi with \i € C 
and SI G u (-^0j then t E is identical to the centraliser of 
Q in u(iV). 



By ortho- normalising the elements kj G %e of the gen- 
erating set is with j = 1,2,..., he, one obtains the pro- 
jectors Pj := \kj)(kj \ according to Eqn. (|124[) to give the 
total projection operator P := Y^j Pj- With this defini- 
tion, the gradient flow U2K of the summarising Table ILT1 
applies and solves Eqn. (|195p . Therefore, the constraint 
of leaving a neutral state E invariant during the transfer 
from A to C can be approached intrinsically by restrict- 
ing the flow from the full unitary group to a compact 
connected Lie subgroup, the stabiliser group Kg of E. 

However, since it may be tedious to check for the sta- 
biliser group Kb in each and every practical instance 
and then project the gradients onto the corresponding 
subalgebra Ie, a more versatile programming tool would 
be welcome. In [27| . we therefore presented a combined 
approach based on the Lagrangc-type function 

L(U) = f 2 (U) - A (ix{E*UEUi} - \\E\\l) (200) 

with f 2 (U) := ItriC^UAUm 2 . and penalty term 
A (tr{E^UEW} - \ \E\\l). Here, the constraint UEW - 
E = was rewritten in the more convenient form 
tr{E^UEW} - \\E\\j = 0. The algorithm given in Ta- 
ble [II] as U2C implements a discretised gradient flow of 
L obtained from the identity 

DL(U) (W) 

= tr { (2 (f 2 (U)* ■ [UAU\ Ct]) 5 _ \[UEU\ &]) f>} . 

Note that the penalty parameter A is increased within 
the recursion to guarantee that the constraint is (at least 
approximately) satisfied in the limit. 
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Table II: Examples of Optimisation Tasks and Related Gradient Flows 



no. target function discretised gradient flow Ref. 



I. Unconstrained Optimisation 



maximisation over the orthogonal group: O G SO(N, R) and A, A € R iVxJV with A diagonal, a k > stepsize 
Of f(0) = tr{A T OAO T } O k+1 =e X p{-a k [O k AO]:,A T ]}O k [16,21,22] 

maximisation over the unitary group: U,V G SU(N) and A,C € C NxN ; [•, -]s and (-)s denote skew-Hermitian parts 

Ul /([/) = Re tr{C t [/4i7 t } C/ fc +i = exp{-a fe [f/ fe Af/^ C + ]s}t4 [26,27] 

U2 /([/) = | triCtf/At/t}] 2 [/ fc+ i =exp{-Q fe ([yl fc ,C ,t ]/*(^)-[^,C t ] t /(^))}^ [26,27] 

where A* := U k Alll 

U3 /(C/,V) = Retr{C* t C/A\/} Eft+i = exp{-a h (U k AV k tf)s}U h [22,28] 

V fc+1 = exp{-/3 fc (V fe C t [/ fc yl)s}^ 

maximisation restricted to subgroups K C U(N) of the unitary group 

with KeK and Pe as projection from jj[(JV, C) onto £, i.e. the Lie algebra to K 

U1K f(K)—RetT{C^KAK' ] '} K k+1 = ^p{-a k P t {K k AK I C*]}K k [here"] 

U2K f{K) = \tT{C*KAKi}\ 2 K k+1 = exp{-a k (P t [A k ,&]f*{K k ) - P t [A k ,C^ f(K k ))}K k [here a ] 

where A k := K k AK ] k 

U3K f(Ki,K 2 ) =Retr{C t K 1 AK 2 } K ( k % = exp{-a fe P, (JC^AfC^ 2 ) C t )}K ( h 1) [28] 

= expi-AP^^f^t^'A)}^ 2 ) 

maximisation restricted to homogeneous spaces G/H of the orthogonal group 
with X G G/H and A, C real symmetric 

OIP f(X) = tr{CX} with X fc := Ado k (A) X k+1 = e - Q <=[ x <- c l X k e +a >> [x >" c] [21, 1151 

maximisation restricted to homogeneous spaces G/H of the unitary group 

with X G G/H and A, C arbitrary complex square and p. as projection from nl(iV, C) onto 6 

U1P f(X) = Retr^X} with X k := Ad Uk {A) X k+1 = e -<*fcW=> ct ].s jsf fc e +Qfc[Xfe ' c ' tls [here] 

U1KP /(X) = Retr^X} with X k := Ad Kk (A) X k+1 = e -a * p *( x * lCt] X k e +akPt[Xk ^ ] [here] 

II. Constrained Optimisation 

maximising L(U) with penalty parameter A G R over the unitary group: U G SU(N); A, C',D,E G 



-JVxJV 



U1C L(C7) =Re/o(f7)- AIm 2 / c ((7) C4+i = exp{-a fe ([A k , C f ] s + 2i\ lmf c (U k )[A k , C%)}U k [27] 

with f c (U) := tv^UAU 1 } where A k := U k AU ] k and X^.s := |(X ± X f ) 

U2C i((7) = |/c(f/)| 2 -A(/ B (t/)-||i?||I) f/ fe+ i=exp{^a fc ((2/a(C/ fc )[A.,C t ]) s -A[P fe ,pt]) }f/fc [27] 

/c(f/) (s.a.) and / B ([7) := tr{P t C/£'(7 t } where A fc := U k AU\ and P fc := t/fcPl^ 

U3C L([/) = |/c([/)| 2 -A|/ D ([/)| 2 U k+1 =exp{-2a k {(fZ(U k )[A k ,&))s-\(fUU k )[A k ,Di]) s )}U k [2JJ 

fc(U) (s.a.) and / D (C7) := tr{DWAW} where A& := C/fcAt/t 



"work presented in part at the MTNS 2006 [11 



Thus, for the constrained optimisation task of max- 
imising the transfer from A to C subject to leaving the 
state E invariant, one has the choice of taking either 
the intrinsic approach U2K or the combined approach of 
U2C. Note, however, that the intrinsic approach restricts 
the flow to the stabiliser group K# at any time, whereas 
the combined method is designed such as to start arbi- 



trarily on U (N) but finally to give an equilibrium point 
onK^. Therefore, the intrinsic approach has the advan- 
tage that the constraint is (at least in principal) properly 
satisfied for the entire iteration. However, there are sit- 
uations where an intrinsic method is impractical as the 
computational costs are too expensive. The combined 
method, in contrast, does not suffer from this shortcom- 
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ing and thus has a wider range of applications. Note 
that the intrinsic approach paves the way to perform (or 
approximate) a transfer from A to C robustly by tak- 
ing Kb as the stabiliser group resistent against a cer- 
tain error class in the sense familiar from stabiliser codes 
[l32l . 1 1331 . [l34]. The extrinsic approach, on the other 
hand, could be taken to transfer one protected state A 
to another one C via intermediate states that are no 
longer necessarily protected against errors as in the in- 
trinsic case. 

Finally, in [27|, Il09| | , we devised a penalty- type gradient 
flow algorithm for solving the constrained optimisation 
problem 

max I tr{C f UAU f }\ subject to tr{£) t t/At/ t } = min . 

To this end, we introduced the Lagrangc-type function 

L(U) := I tr{tf UAtf}\ 2 - A | tr{D^UAU^}\ 2 , 

to maximise the transfer from A to C while suppressing 
the transfer from A to D. This leads to the recursive 
scheme U3C in Table HU 

V. CONCLUSIONS 

The ability to calculate optima of quality functions for 
quantum dynamical processes and to determine steerings 
in concrete experimental settings that actually achieve 
these optima is tantamount to exploiting and manipulat- 
ing quantum effects in future technology. 

To this end, we have presented a comprehensive ac- 
count of gradient flows on Riemannian manifolds (see 
general scheme of Fig. [5]) allowing for generically conver- 
gent quantum optimisation algorithms — an ample array 
of explicit examples being given in Tab. [Til Since the 
state space of quantum dynamical systems can often be 
represented by smooth manifolds, the unified foundations 
given are illustrated by many applications for numeri- 
cally addressing two categories of problems: (a) we have 
focussed on abstract optimisation tasks over the dynamic 
group, and (b) we have sketched the relation to optimal 
control tasks in specified experimental settings. 

In the present work on closed systems, a variety of ap- 
plications are addressed by relating the dynamics to Lie 
group actions of the unitary group and its closed sub- 
groups. Since symmetries give rise to stabiliser groups, 
particular emphasis has been on gradient flows on homo- 
geneous spaces. Theory and algorithms have been struc- 
tured and tailored for the following scenarios: 

(i) for Lie groups with bi-invariant metric, 

(ii) for closed subgroups 

(iii) for compact Riemannian symmetric spaces, 
or, more generally, 

(iv) for naturally reductive homogeneous spaces. 



As soon as the homogeneous spaces are no longer natu- 
rally reductive, the 'usual' way of obtaining geodesies on 
quotient spaces (by projecting geodesies from the group 
level to the quotient) fails. Alternatives of local approx- 
imations have been sketched in these cases in order to 
structure future developments. 

Techniques based on the Riemannian exponential are 
easy to implement on Lie groups (with bi-invariant met- 
ric) and their closed subgroups. In particular, gradient 
flows on subgroups of the unitary group allow to ad- 
dress different partitionings of m-party quantum systems, 
the finest one being the group of purely local operations 
SU{2)®SU{2)®- ■ -®SU{2). The corresponding gradient 
flows have several applications in quantum dynamics: for 
instance they prove useful tools to decide whether effec- 
tive multi-qubit interaction Hamiltonians generate time 
evolutions that can be reversed in the sense of Hahn's 
spin echo solely by local operations. 

As a new application, gradient flows on SU{N\) ® 
SU(N 2 ) ® ■ • • ® SU(N m ) turned out to be a valuable 
and reliable alternative to conventional tensor-SVD meth- 
ods for determining best rank-1 tensor approximations to 
higher-order tensors. In the case of m-party multipartite 
pure quantum states, they can readily be applied to op- 
timising entanglement witnesses. 

Double-bracket flows have been characterised as a spe- 
cial case of a broader class of gradient flows on naturally 
reductive homogeneous orbit spaces. Here, in view of us- 
ing gradient techniques for ground-state calculations [36| , 
it is important to note that double-bracket flows can also 
be established for any closed subgroup of SU(N): by al- 
lowing for different partitionings SU {Ni)®SV (A^)®- • -<g> 
SU(N m ), one may set up a common frame to compare 
different types of unitary networks [H, 49] for calculating 
and simulating large-scale quantum systems. 

Moreover, we have shown how techniques of restrict- 
ing a gradient flow to subgroups also prove a useful tool 
for addressing constrained optimisation tasks by ensur- 
ing the constraints arc fulfilled intrinsically. As an al- 
ternative, we have devised gradient flows that respect 
the constraints extrinsically, i.e., by way of penalty- type 
Lagrange parameters. These methods await application, 
e.g., in error-correction and robust state transfer. 

Finally, in a follow-up study, we discuss the dynam- 
ics of open quantum systems in terms of Lie semigroups 
[58| . We sketch relations between the theory of Lie semi- 
groups and completely positive semigroups. In particu- 
lar in open systems, an easy characterisation of reach- 
able sets arises only in very simple cases. It thus poses 
a current limit to an abstract optimisation approach on 
reachable sets. However, in these cases, gradient-assisted 
optimal control methods again prove valuable. 

Therefore, not only does the current work give the jus- 
tification to some recent developments, it also provides 
new techniques to the field of quantum control. It shows 
how to exploit the differential geometry in Lie theoreti- 
cal terms for optimisation of dynamics on quantum-state 
manifolds. Thus the comprehensive theoretical treat- 



35 



ment illustrated by known examples and new practical 
applications has been given to fill a gap. — We anticipate 
the ample array of methods and their exemplifications 
will find broad application. The account of theoretical 
foundations is also meant to structure and trigger fur- 
ther basic research thus widening the set of useful tools. 



VI. APPENDIX SECTION 

Appendix A: Frechet Differentials and Tangent 
Maps 

For basic differential geometric terms and definitions 
we refer to [86|. Here, we recall only the fundamental 
relation between Frechet derivatives and tangent maps 
for finite dimensional smooth manifolds. 

Definition VI. 1 (Frechet derivative). Let Ti and Ti' be 

real or comlpex Hilbcrt spaces. Let F : U — > Ti' be a map 
defined in some open neighbourhood U C Ti of X. Then 
F is (Frechet) differentiable in X, if there exists a linear 
operator DF(X) : Ti — > Ti' and a map rx ■ Ti — > TL' such 
that 

F(X + A) - F(X) = DF(X)A + r x (A)\\ A|| (201) 

for A e Ti with X + A<E U and ||r x (A)|| -> for ||A|| -> 

0. The linear operator DF{X) is called the (Frechet) 
derivative of F in X and F is said to be smooth, if it is 
Frechet differentiable of any order for all X £ U. 

Remark VI. 1. (a) It is more common to stress the 
term 'Frechet' derivative in an infinite dimensional 
setting to distinguish from other non-equivalent dif- 
ferentiability notions, cf. [135]. In finite dimen- 
sions, however, DF(X) is often simply called 'the' 
derivative of F. 

(b) Note, that if DF(X) is complex linear for all X G 
U, then F is immediately holomorphic, i. e. F can 
be locally expanded in a power scries, cf. [l35| |. 

Now, let M be a smooth manifold. If M is embedded 
in a Hilbert space Ti, then the tangent space T X M to M 
at the point X is the set of all tangent vectors £ at X, 

1. e. the set of all 'velocity' vectors 



? =d7^ 



t=0 



of smooth curves in M through X = 7(0). If M is not 
embedded in a Hilbcrt space Ti, the situation is slightly 
more complicated. Then, TxM is defined as the set of 
all equivalence classes £ = [7] of smooth curves 7 in M 
through X = 7(0) having the same 'velocity' at t = 0, 
i.e. 7 and 7 represent the same equivalence class £ if and 
only if 



for one and thus for all charts <f> at X. These equivalence 
classes £ are called tangent vectors. Note that either way 
the tangent space TxM is a vector space isomorphic to 
the chart space. 



Definition VI. 2 (Tangent map), 
smooth manifolds. Take f : M — 
map. 



Let M and N be 
N as a continuous 



(a) Then / is called smooth, if ifs o / o < 
all admissible charts cf) and ip. 



is smooth for 



(b) If / is smooth, the linear map Df{x) : TxM — > 
T f(x)N given by 

€ = [7]-£>/W(0 := Lf°7] 

is called the tangent map to / : A/ — > 2V at X € M. 

Therefore, differentiability on manifolds is locally ex- 
pressed in terms of charts. The associated tangent maps 
obey the standard rules of differential calculus, like the 
chain rule, etc. 

Finally, we quote two useful results for computing tan- 
gent maps in the case of embedded manifolds. 

Fact 1. If N is embedded in a Hilbert space H, the above 
definition of the tangent map reads 



e-D/P0(0 = ^(/o 7 )(t) 



where 7 denotes a representative of the tangent vector £. 

Fact 2. Let H, W be Hilbert spaces and let M C H, 
N C TL' be embedded submanifolds. Furthermore, let 
F : H -»■ W be a smooth map with F{M) C N. Then 
the tangent map of / := F\m : M — > N at X is given by 
the restriction of the derivative DF(X) to the tangent 
space of M at X, i.e. 



Df(X) = DF{X)\ TxM : T X M 



T f(X )N. 



Appendix B: Polarisation Procedure 

What is meant by the term polarisation procedure? Be- 
low, we give a short description of the concept and a 
sketch of its proof. 

Let TC be a real Hilbcrt space and /3:H^la bounded 
quadratic form, i.e., there exists a bounded symmetric bi- 
linear form B : Ti x Ti — > M such that 



p(v) = B(v,v), 



(202) 



for all v <E Ti. 



t=o di 



Claim A 1. Let (3 be a bounded quadratic form. The 
bounded symmetric bi-linear form B which satisfies h20ty) 
is uniquely determined by (3. 
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Proof. By the symmetry and bi-linearity of B we 
have 

B(v + w,v + w) = B(v, v) + B(w, w) + 2B(v, w) 
and hence 

B(v,w) = — (j3(v + w, v + w) — B(v, v) — B(w, w)^ 

= l(p(v + w)-p(v)-/3(w)) (203) 

for all v,w G Ti.. Therefore, B is uniquely determined by 
the quadratic form (3. □ 

The above identity (|203|) is frequently called the law of 
polarisation. Next, we show that (3 can always be repre- 
sented by a selfadjoint linear operator. 

Claim A 2. For any bounded quadratic form (3 there 
exists a unique bounded selfadjoint linear operator B : 
TL — > Ti such that 

(v\Bv)=P(v), for all v eTt. (204) 

Proof. A straightforward application of the Riesz 
Representation Theorem yields that any bounded sym- 
metric bi-linear form B on Ti can be r epre sented by a 
bounded selfadjoint linear operator, cf. [136J. From this 
the result follows immediately □ 



Remark A 1. If (3 is any form such that (3(tv) = t 2 /3(v) 
and /3(v) < K\\v\\ 2 is satisfied for all i G K and v G Ti, 
then (3 is in general not induced by a bounded symmetric 
bi-linear form, or in other words, (|203[) does in general 
not define a bounded symmetric bi-linear form. However, 
if (3 meets the parallelogram identity, i.e. 

(3{v + w) + (3(v -w) = 2f3{v) + 2(3{w) (205) 

for all v,w G Ti, then (|203|) does define a bounded sym- 
metric bi-linear form. 
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